Posted on: 30/10/2025
Description :
Senior Site Reliability Engineer (SRE)
Location : Bangalore, India
Experience : 6+ Years
Domain : DevOps/Cloud Infrastructure
About the Role :
We are looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team.
You will be instrumental in ensuring the reliability, scalability, and performance of our global microservices platform.
This role involves deep hands-on work with cloud infrastructure, automation, and applying software engineering principles to operations.
Key Responsibilities :
- Design, implement, and manage highly resilient and scalable infrastructure in a multi-cloud environment (AWS/GCP).
- Develop, deploy, and manage Infrastructure as Code (IaC) using Terraform or CloudFormation.
- Maintain and optimize our container orchestration platform built on Kubernetes and Docker.
- Implement and manage end-to-end CI/CD pipelines using GitLab CI, Jenkins, or ArgoCD for automated and safe releases.
- Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs), and build robust monitoring and alerting using Prometheus, Grafana, and ELK Stack.
- Lead incident response, troubleshooting, and post-mortem analysis to prevent recurrence and reduce system toil.
- Write quality, well-tested code in Python or Go to automate operational tasks and improve system observability.
Technical Skills Required :
- Expertise in Kubernetes and container management.
- Strong proficiency in a major cloud platform (AWS preferred).
- Mastery of Terraform or equivalent IaC tool.
- Solid experience with monitoring tools like Prometheus, Grafana, and Datadog.
- Scripting proficiency in Python or Go.
- In-depth knowledge of Linux internals, networking (TCP/IP, Load Balancing), and distributed systems
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1567400
Interview Questions for you
View All