Posted on: 06/10/2025
Experience Required : 3- 6 years.
Location: Gurgaon.
Department: Product and Engineering.
Working Days: Alternate Saturdays Working wfh (1st and 3rd).
Key Responsibilities :
- Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services.
- Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices.
- Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure provisioning.
- Containerize applications and optimize Docker images for performance and security.
- Ensure CI/CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure deployments.
- Drive SRE principles including monitoring, alerting, SLIs/SLOs, and incident response.
- Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
- Automate routine tasks with scripting languages (Python, Bash, etc.).
- Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure.
Collaborate closely with development teams to enable DevSecOps best practices.
Participate in on-call rotations, handle outages with calm, and conduct postmortems.
Must-Have Technical Skills :
- Kubernetes (EKS, Helm, Operators). Docker & Docker Compose.
- Terraform (modular, state management, remote backends). AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS/EKS).
- Linux system administration.
- Database tuning based on hardware config.. CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions).
- Logging & monitoring tools: ELK, Prometheus, Grafana, CloudWatch.
- Site Reliability Engineering practices.
- Load balancing, autoscaling, and HA architectures.
Good-To-Have :
- GCP or Azure exposure. Security hardening of containers and infrastructure.
- Chaos engineering exposure. Knowledge of networking (DNS, firewalls, VPNs).
Soft Skills :
- Strong problem-solving attitude; calm under pressure.
- Good documentation and communication skills.
- Ownership mindset with a drive to automate everything.
- Collaborative and proactive with cross-functional teams. Show more Show less
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1556593
Interview Questions for you
View All