Posted on: 18/08/2025
Experience Required : 4-6 years.
Location : Gurgaon.
Department : Product and Engineering.
Working Days : Alternate Saturdays Working (1st and 3rd).
Key Responsibilities :
- Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services.
- Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices.
- Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure provisioning.
- Containerize applications and optimize Docker images for performance and security.
- Ensure CI/CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure deployments.
- Drive SRE principles including monitoring, alerting, SLIs/SLOs, and incident response.
- Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.
- Automate routine tasks with scripting languages (Python, Bash, etc.
- Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure.
- Collaborate closely with development teams to enable DevSecOps best practices.
- Participate in on-call rotations, handle outages with calm, and conduct postmortems.
Must-Have Technical Skills :
- Kubernetes (EKS, Helm, Operators).
- Docker & Docker Compose.
- Terraform (modular, state management, remote backends).
- AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS/EKS).
- Linux system administration.
- CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions).
- Logging & monitoring tools : ELK, Prometheus, Grafana, CloudWatch.
- Site Reliability Engineering practices.
- Load balancing, autoscaling, and HA architectures.
Good-To-Have :
- GCP or Azure exposure.
- Service Mesh (Istio, Linkerd).
- Secrets management (Vault, AWS Secrets Manager).
- Security hardening of containers and infrastructure.
- Chaos engineering exposure.
- Knowledge of networking (DNS, firewalls, VPNs).
Soft Skills :
- Strong problem-solving attitude; calm under pressure.
- Good documentation and communication skills.
- Ownership mindset with a drive to automate everything.
- Collaborative and proactive with cross-functional teams.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1531304
Interview Questions for you
View All