Posted on: 07/01/2026
Description :
Key Responsibilities :
- Design and implement end-to-end observability architectures using Grafana and related monitoring tools.
- Develop and maintain advanced Grafana dashboards that provide clear, actionable insights for engineering and operations teams.
- Lead monitoring strategy across metrics, logs, and traces using tools such as Prometheus, Loki, Grafana Cloud, and other observability platforms.
- Collaborate with DevOps, SRE, and application teams to improve system reliability, performance, and incident response
- Automate monitoring and alerting workflows using infrastructure-as-code and scripting.
Required Qualifications :
- 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or infrastructure automation roles.
- 3+ years of hands-on experience with Grafana, including dashboard design and customization.
- Strong proficiency in scripting and automation using Python, Bash, or Go.
- Experience with Kubernetes and cloud platforms (AWS, GCP, Azure).
- Familiarity with OpenTelemetry.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Technical / Solution Architect
Job Code
1597789