Experience : 7+ years

We are looking for a highly experienced Senior SRE with strong expertise in AWS to help design, operate, and scale the infrastructure powering our product platforms.

This is a mission-critical role in a fast-moving product development environment, where system reliability, automation, and performance are core business drivers.

Key Responsibilities :

Reliability & Operations :

- Own reliability, availability, and performance of large-scale production systems.

- Establish SLOs, SLAs, and error budgets for mission-critical services.

- Lead incident response, root cause analysis, and continuous improvement initiatives.

- Design fault-tolerant architectures and disaster recovery strategies.

Cloud & Infrastructure Engineering :

- Architect, deploy, and manage infrastructure on AWS using IaC (Terraform / CloudFormation).

- Optimize cloud costs while maintaining performance and reliability.

- Implement multi-region, highly available architectures.

- Manage container platforms (Docker, Kubernetes, EKS).

Automation & DevOps :

- Build automation pipelines for infrastructure provisioning, deployment, and scaling.

- Improve CI/CD pipelines and release engineering processes.

- Develop tools and scripts to reduce operational toil.

Observability & Performance :

- Implement comprehensive monitoring, logging, and alerting systems.

- Drive performance tuning and capacity planning.

- Lead chaos engineering and resilience testing practices.

Required Skills & Experience :

- 7+ years in Site Reliability Engineering / DevOps / Infrastructure roles.

- Deep hands-on experience with AWS services (EC2, EKS, RDS, S3, Lambda, VPC, IAM, etc.

- Expertise in infrastructure as code : Terraform, CloudFormation.

- Strong experience with Linux systems, networking, and distributed systems.

- Experience with Kubernetes, container orchestration, and microservices environments.

- Strong scripting skills (Python, Bash, Go).

- Knowledge of security best practices and compliance requirements

Did you find something suspicious?

Similar jobs that you might be interested in

Posted by

Rajanish Pandey

Senior Manager TA at SAILS SOFTWARE SOLUTIONS PRIVATE LIMITED

Last Active: 20 Jan 2026

Job Views:
43

Applications: 35

Recruiter Actions: 6

Posted in

DevOps / SRE

Functional Area

Site Reliability Engineering

Job Code

1601047

Jobs by location

Interview Questions for you

View All

How to Write Leave Application for Urgent Work: Format & Samples (2025)

Top 90+ Machine Learning Interview Questions and Answers

Top 40+ Deep Learning Interview Questions and Answers