Posted on: 30/07/2025
Job Description :
As a Principal/Chief Site Reliability Engineer, you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems.
Youll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure.
Responsibilities :
- Design and implement scalable, highly available infrastructure and automation solutions.
Key Requirements :
- Experience: 15+ years in technology, with at least 5+ years in Site Reliability Engineering.
- Development Background: Strong hands-on experience in C/C++, Java, Go, or Python.
- Proven experience as a hands-on Individual Contributor (not a managerial role).
- Proficiency in scripting, system programming, and multi-platform architecture.
- Deep knowledge of:
a. Linux/Unix OS fundamentals.
b. Networking (DNS, TCP/IP, etc.
c. Cloud platforms (preferably AWS).
d. Observability and Monitoring Tools.
e. CI/CD and Infrastructure as Code.
- Strong exposure to SRE concepts: reliability, automation, on-call best practices, etc.
- System design, performance tuning, and troubleshooting large-scale systems.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1522073
Interview Questions for you
View All