Posted on: 23/07/2025
Are you ready to help shape the future of cloud contact centers? Were building scalable, reliable, and cutting-edge infrastructure for world-class customer experiences and were looking for a Senior SRE to join our team! If you're passionate about optimizing systems, automating everything, and ensuring top-tier performance, we want to hear from you.
What Youll Do :
- Lead Developer Experience Initiatives : Drive and lead efforts in building a seamless and efficient developer experience, empowering our engineering teams to deliver high-quality software faster.
- Own Internal PaaS Systems : Take ownership of and continuously improve our internal Platform-as-a Service (PaaS) systems, leveraging technologies such as Kubernetes, GitHub Actions, ArgoCD, and Helm.
- Automate Infrastructure & CI/CD : Design, implement, and maintain robust automation for our infrastructure and continuous integration/continuous delivery (CI/CD) pipelines, ensuring rapid and reliable deployments.
- Drive Observability & Reliability : Champion and implement best practices for observability, system reliability, and operational excellence across our cloud infrastructure. This includes monitoring, alerting, logging, and incident response.
- Collaborate Cross-Functionally : Work closely with development, product, and other operations teams to solve complex system challenges, ensuring our infrastructure supports evolving business needs.
What You Bring :
- Experience : 5- 8 years of hands-on experience in a Site Reliability Engineer (SRE) or DevOps role, with a proven track record of managing and scaling production systems.
- CI/CD & IaC Expertise : Deep knowledge and practical experience with CI/CD methodologies and tools, as
well as Infrastructure as Code (IaC) tools like Terraform or Ansible.
- Kubernetes Proficiency : Strong expertise in designing, deploying, and managing applications on Kubernetes clusters.
- Programming Skills : Strong programming skills in Java, Kotlin, or Scala, or excellent scripting abilities in Python or Bash.
- Distributed Systems Understanding : A solid understanding of large-scale distributed systems, their challenges, and best practices for ensuring their reliability and performance.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1518410
Interview Questions for you
View All