Posted on: 14/01/2026
Description :
Location : Vizag (On Site)
Experience : 7+ years
We are looking for a highly experienced Senior SRE with strong expertise in AWS to help design, operate, and scale the infrastructure powering our product platforms.
This is a mission-critical role in a fast-moving product development environment, where system reliability, automation, and performance are core business drivers.
Key Responsibilities :
- Own reliability, availability, and performance of large-scale production systems.
- Establish SLOs, SLAs, and error budgets for mission-critical services.
- Lead incident response, root cause analysis, and continuous improvement initiatives.
- Design fault-tolerant architectures and disaster recovery strategies.
Cloud & Infrastructure Engineering :
- Architect, deploy, and manage infrastructure on AWS using IaC (Terraform / CloudFormation).
- Optimize cloud costs while maintaining performance and reliability.
- Implement multi-region, highly available architectures.
- Manage container platforms (Docker, Kubernetes, EKS).
Automation & DevOps :
- Improve CI/CD pipelines and release engineering processes.
- Develop tools and scripts to reduce operational toil.
Observability & Performance :
- Drive performance tuning and capacity planning.
- Lead chaos engineering and resilience testing practices.
Required Skills & Experience :
- Deep hands-on experience with AWS services (EC2, EKS, RDS, S3, Lambda, VPC, IAM, etc.
- Expertise in infrastructure as code : Terraform, CloudFormation.
- Strong experience with Linux systems, networking, and distributed systems.
- Experience with Kubernetes, container orchestration, and microservices environments.
- Strong scripting skills (Python, Bash, Go).
- Knowledge of security best practices and compliance requirements
Did you find something suspicious?
Posted by
Rajanish Pandey
Senior Manager TA at SAILS SOFTWARE SOLUTIONS PRIVATE LIMITED
Last Active: 20 Jan 2026
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1601047