HamburgerMenu
hirist

Job Description

Description :

We are seeking an experienced and highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of enterprise-grade systems. The ideal candidate will have strong hands-on experience with observability and monitoring tools and the ability to lead and manage technical teams.

This role involves working closely with development, operations, and data engineering teams to build, monitor, and maintain high-performance data pipelines supporting both real-time and batch processing workloads. The candidate will play a critical role in improving system reliability, operational efficiency, and incident response processes.

Key Responsibilities :

- Design, implement, and manage monitoring, alerting, and observability solutions to ensure system health and availability.

- Build and maintain reliable, scalable, and high-performing systems supporting enterprise applications.

- Monitor production environments and proactively identify performance bottlenecks and reliability risks.

- Lead incident management activities, including troubleshooting, root cause analysis, and post-incident reviews.

- Collaborate with cross-functional teams to improve system resilience, fault tolerance, and automation.

- Support and optimize both real-time and batch data pipelines from a reliability and performance standpoint.

- Mentor and guide team members while remaining actively involved in hands-on technical work.

- Drive continuous improvement initiatives focused on system stability, monitoring maturity, and operational excellence.

Key Skills & Requirements :

- 7+ years of experience as a Site Reliability Engineer (SRE) or in a similar reliability/operations role.

- In-depth experience in at least two of the following tools :

1. Grafana

2. Datadog

3. Dynatrace

4. Splunk

- Strong understanding of system monitoring, observability, alerting, and performance optimization.

- Proven experience in team handling and technical leadership, with the ability to guide and mentor engineers.

- Strong hands-on experience in production environments and reliability engineering best practices.

- Experience working in collaborative, fast-paced enterprise environments.

Educational Qualifications :

- Bachelors degree in Computer Science, Information Technology, or a related discipline.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in