Posted on: 07/12/2025
About the job :
Join us in bringing joy to customer experience.
Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide.
Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together.
We celebrate diversity and foster an inclusive environment, empowering our employees to be their authentic selves.
We are seeking a Site Reliability Engineer (SRE) to join our team and help build and maintain highly reliable, scalable systems.
This role combines software engineering and operations expertise to ensure our services meet and exceed ambitious reliability targets while enabling rapid development and deployment.
This position requires approximately 50% software development and 50% operational work, focusing on automation, monitoring, and system reliability rather than manual operations.
The team works collaboratively with our platform, application, and database teams to provide a reliable and available service.
Key Responsibilities :
Observability & Monitoring :
- Dashboards & Metrics : Design and implement comprehensive dashboards covering OS/platform-level and application-level monitoring, broken into primary (RED) and secondary indicators (USE).
- Availability & Reliability : Establish and maintain SLIs, SLOs, and error budgets for the service.
- Performance Monitoring : Build alerting systems and performance monitoring to proactively identify and resolve issues before they impact users.
- Incident Response : Participate in on-call rotations, lead incident response efforts (including post-mortem analysis and remediation), maintain on-call routing, and assign application-level problems to engineering teams.
Infrastructure Automation & Deployment :
- CI/CD Pipeline Management : Build and optimize CI/CD pipelines for speed and resilience.
- Infrastructure as Code : Develop and maintain infrastructure using tools like Terraform, Ansible, or similar.
- Configuration Management : Automate system configuration and ensure consistency across environments. Implement and recommend best practices for configuration control.
Security & Compliance :
- Security Automation : Ensure security scanning systems are in place and review escalated vulnerabilities.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1586169
Interview Questions for you
View All