Posted on: 08/12/2025
Description :
About the Role :
We are looking for a skilled DevOps / Site Reliability Engineer (SRE) who can take ownership of our cloud infrastructure and help build highly reliable, scalable systems following modern SRE best practices.
Key Responsibilities :
- Design, develop, and maintain Infrastructure as Code (IaC) using Terraform to ensure scalable, consistent, and repeatable environments.
- Build, monitor, and optimize AWS-based infrastructure, including API Gateway, ECS, EC2, S3, RDS, and Event Bridge.
- Create and manage robust CI/CD pipelines to enable smooth deployments, automated testing, and quick rollbacks.
- Implement and manage containerized workloads using Docker, and orchestrate deployments with Kubernetes (basic to intermediate proficiency).
- Apply core SRE principles : define and measure SLIs/SLOs, manage error budgets, set up monitoring/logging/alerting, lead incident response, and automate operational tasks.
- Collaborate closely with development teams to ensure application reliability, scalability, security, and performance.
- Use strong Linux and networking fundamentals for troubleshooting, optimization, and environment stability.
- Continuously analyze system performance and recommend improvements to enhance reliability, performance, and cost efficiency.
Required Skills & Experience :
- 3 - 5 years of hands-on experience in DevOps, SRE, or cloud infrastructure roles.
- Strong proficiency in Terraform for infrastructure provisioning.
- Solid hands-on experience with core AWS services (API Gateway, ECS, EC2, S3, RDS, Event Bridge).
- Experience building and maintaining CI/CD pipelines using tools like Jenkins, GitHub Actions, or GitLab CI/CD.
- Good understanding of Linux administration and networking fundamentals (TCP/IP, DNS, VPNs).
- Practical experience with Docker and Kubernetes (basic to intermediate).
- Working knowledge of SRE principles including SLIs, SLOs, error budgets, reliability engineering, and incident management.
- Strong problem-solving, debugging, and troubleshooting skills.
- Proactive, self-driven, and comfortable working in a fast-paced, dynamic environment.
- Mandatory : Willingness to work onsite in our Bangalore office.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1586056
Interview Questions for you
View All