HamburgerMenu
hirist

Sails Software - Site Reliability Engineer - AWS

SAILS SOFTWARE SOLUTIONS PRIVATE LIMITED
Vishakhapatnam/Vizag
5 - 7 Years

Posted on: 14/01/2026

Job Description

Description :


Title : SRE-AWS


Location : Vizag (On Site)


Experience : 7+ years

We are looking for a highly experienced Senior SRE with strong expertise in AWS to help design, operate, and scale the infrastructure powering our product platforms.

This is a mission-critical role in a fast-moving product development environment, where system reliability, automation, and performance are core business drivers.

Key Responsibilities :


Reliability & Operations :


- Own reliability, availability, and performance of large-scale production systems.

- Establish SLOs, SLAs, and error budgets for mission-critical services.

- Lead incident response, root cause analysis, and continuous improvement initiatives.

- Design fault-tolerant architectures and disaster recovery strategies.

Cloud & Infrastructure Engineering :

- Architect, deploy, and manage infrastructure on AWS using IaC (Terraform / CloudFormation).

- Optimize cloud costs while maintaining performance and reliability.

- Implement multi-region, highly available architectures.

- Manage container platforms (Docker, Kubernetes, EKS).

Automation & DevOps :


- Build automation pipelines for infrastructure provisioning, deployment, and scaling.

- Improve CI/CD pipelines and release engineering processes.

- Develop tools and scripts to reduce operational toil.

Observability & Performance :


- Implement comprehensive monitoring, logging, and alerting systems.

- Drive performance tuning and capacity planning.

- Lead chaos engineering and resilience testing practices.

Required Skills & Experience :


- 7+ years in Site Reliability Engineering / DevOps / Infrastructure roles.

- Deep hands-on experience with AWS services (EC2, EKS, RDS, S3, Lambda, VPC, IAM, etc.

- Expertise in infrastructure as code : Terraform, CloudFormation.

- Strong experience with Linux systems, networking, and distributed systems.

- Experience with Kubernetes, container orchestration, and microservices environments.

- Strong scripting skills (Python, Bash, Go).

- Knowledge of security best practices and compliance requirements


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in