Posted on: 04/02/2026
Description : Job Description: Software Development Engineer 2 (DevOps & Infrastructure)
About Client :
It is a fast-growing, Series A funded health-tech startup dedicated to making healthcare affordable and accessible. We provide a transparent platform for high quality substitute medicines, helping consumers save significantly on chronic healthcare costs.
Experience : 4 - 6 year
Employment Type : Full-time (Bangalore)
Responsibilities :
- Infrastructure as Code (IaC) : Lead the transformation of existing cloud infrastructure into fully automated code using Terraform, Pulumi, or CloudFormation.
- Architecture Evolution : Lead the evolution of our infrastructure to handle 10x traffic growth, focusing on auto-scaling, high availability, and multi-region resilience.
- Cost Optimization : Analyze and optimize cloud infrastructure spending by right-sizing resources, implementing spot instances, and identifying cost-saving architectural trade-offs.
- Advanced Deployment Pipelines : Lead the design and implementation of Canary pipeline setups to ensure zero-downtime releases and safe feature rollouts.
- Lifecycle Management : Ensure the entire tech stack, including Kubernetes versions, database engines, and OS images, are regularly updated to the latest stable versions to mitigate security risks.
- Guardianship of Quality : Lead and participate in Technical Specs authored by other engineers, providing critical feedback on infrastructure security and cost-efficiency.
- Uptime & Reliability : Own the "uptime" of our platform by implementing advanced monitoring, automated failovers, and robust disaster recovery strategies.
- Security & Compliance : Ensure that all cloud infrastructure adheres to industry standards for data security, network isolation, and privacy.
Requirements :
- 4+ years of experience at high-growth product companies (Tier-1 startups or Big Tech) with a focus on high-scale infrastructure and cloud operations.
- Proven track record of moving manual infrastructure to code and managing complex state files in a production environment.
- Deep hands-on knowledge of AWS, Kubernetes (EKS), and Docker for managing microservices at scale.
- Experience building and optimizing high-speed CI/CD deployment pipelines for both backend and frontend applications.
- Experience setting up and managing observability frameworks such as Prometheus, Grafana or Datadog.
- Deep experience with AWS networking (VPC, Peering, Transit Gateway, Route53) and message brokers like SQS/RabbitMQ.
- Good to have experience managing high-concurrency data stores like ClickHouse and setting up robust data pipelines
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1609612