About the Role
We are looking for a Platform Engineer to help build, scale, and maintain the CogitX platform, a highly extensible, planet-scale, enterprise-grade system designed to support complex workflows, integrations, and AI-driven capabilities.
You will play a critical role in shaping the platform’s infrastructure, developer experience, and core services to ensure reliability, scalability, and performance at scale.
Key Responsibilities
Platform & Infrastructure
- Design, build, and maintain scalable, secure, and highly available platform infrastructure
- Manage containerized workloads using Kubernetes
- Implement and optimize networking, ingress, and traffic routing (NGINX, API Gateway, Front Door, etc.)
- Ensure high availability, fault tolerance, and disaster recovery readiness
Developer Platform & Tooling
- Build internal developer platforms and tooling to improve engineering productivity
- Design CI/CD pipelines for automated build, test, and deployment workflows
- Enable seamless local-to-cloud development environments
Cloud & DevOps
- Manage cloud infrastructure using Infrastructure as Code (Bicep, Terraform, ARM)
- Optimize cost, performance, and resource utilization across environments
- Implement secure access patterns (RBAC, Managed Identity, Private Endpoints)
Observability & Reliability
- Implement monitoring, logging, and alerting systems (Prometheus, Grafana, Azure Monitor, etc.)
- Define and track SLIs/SLOs for platform reliability