Posted on: 11/09/2025
About the Role :
We are looking for highly skilled Senior Site Reliability Engineers (SREs) who will be hands-on contributors in building and maintaining reliable, scalable, and secure infrastructure. This role demands strong technical depth, problem-solving perseverance, and a proactive mindset to ensure the success of our products and platforms.
Key Responsibilities :
- Drive reliability, scalability, and performance improvements across systems and infrastructure.
- Design, implement, and maintain CI/CD pipelines, monitoring systems, and automation frameworks.
- Own infrastructure build-outs, ensuring high availability and resilience.
- Troubleshoot and resolve complex issues spanning infrastructure and applications.
- Collaborate with cross-functional engineering teams to improve reliability and operational excellence.
- Champion Infrastructure-as-Code (IaC) practices and ensure robust automation.
- Take full ownership of product reliability and success from an SRE perspective.
Key Attributes & Skills :
- Proven experience in Site Reliability Engineering or DevOps.
- Strong hands-on skills in CI/CD, monitoring, automation, Infrastructure as Code (Terraform/Ansible, etc.).
- Deep expertise in debugging and troubleshooting at both infrastructure and application levels.
- Self-starter with perseverance able to push through complex technical challenges.
- Ability to figure things out independently with minimal guidance.
- Strong internal communication and collaboration skills (not client-facing but must work well with teams).
- Ownership-driven, with a strong sense of accountability for product success.
- Bonus : Experience in customer/internal stakeholder communication.
Qualifications :
- Bachelors/Masters degree in Computer Science, Engineering, or equivalent practical experience.
- 5+ years of hands-on experience in SRE, DevOps, or Infrastructure Engineering.
- Familiarity with cloud platforms (AWS, GCP, Azure), containerization (Docker, Kubernetes), and observability tools (Prometheus, Grafana, ELK, etc.).
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1544936
Interview Questions for you
View All