Posted on: 08/01/2026
Description :
Location : Gurgaon.
Department : Product and Engineering.
Working Days : Alternate Saturdays Working wfh (1st and 3rd).
Key Responsibilities :
- Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services.
- Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices.
- Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure
provisioning.
- Containerize applications and optimize Docker images for performance and security.
- Ensure CI/CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure
deployments.
- Drive SRE principles including monitoring, alerting, SLIs/SLOs, and incident response.
- Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
- Automate routine tasks with scripting languages (Python, Bash, etc.).
- Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure.
- Collaborate closely with development teams to enable DevSecOps best practices.
- Participate in on-call rotations, handle outages with calm, and conduct postmortems.
Must-Have Technical Skills :
- Kubernetes (EKS, Helm, Operators)
- Docker & Docker Compose
- Terraform (modular, state management, remote backends)
- AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS/EKS)
- Linux system administration
- Database tuning based on hardware config.
- CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions)
- Logging & monitoring tools : ELK, Prometheus, Grafana, CloudWatch
- Site Reliability Engineering practices
- Load balancing, autoscaling, and HA architectures
Good-To-Have :
- GCP or Azure exposure
- Security hardening of containers and infrastructure
- Chaos engineering exposure
- Knowledge of networking (DNS, firewalls, VPNs)
Soft Skills :
- Strong problem-solving attitude; calm under pressure
- Good documentation and communication skills
- Ownership mindset with a drive to automate everything
- Collaborative and proactive with cross-functional teams
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1598257