HamburgerMenu
hirist

Monitoring & Observability Engineer - Datadog

FUNIC TECH PRIVATE LIMITED
Remote
4 - 7 Years

Posted on: 31/07/2025

Job Description

Job Title : Monitoring & Observability Engineer Datadog Specialist

Experience : 4+ Years

Location : [Specify Location or Remote]

Job Type : Full-Time


Job Summary :


We are looking for a talented Observability Engineer with hands-on experience in Datadog to enhance our infrastructure and application monitoring capabilities. The ideal candidate will have a strong understanding of performance monitoring, alerting, and observability in cloud-native environments.


Key Responsibilities :


- Design, implement, and maintain observability solutions using Datadog for applications, infrastructure, and cloud services.

- Set up dashboards, monitors, and alerts to proactively detect and resolve system issues.

- Collaborate with DevOps, SRE, and application teams to define SLOs, SLIs, and KPIs for performance monitoring.

- Integrate Datadog with services such as AWS, Kubernetes, CI/CD pipelines, and logging tools.

- Conduct performance tuning and root cause analysis of production incidents.

- Automate observability processes using infrastructure-as-code and scripting (e.g., Terraform, Python).

- Stay up-to-date with the latest features and best practices in Datadog and observability space.


Must-Have Skills :


- 4+ years of experience in monitoring/observability, with 2+ years hands-on experience in Datadog

- Strong experience with Datadog APM, infrastructure monitoring, custom metrics, and dashboards

- Familiarity with cloud platforms like AWS, GCP, or Azure

- Experience monitoring Kubernetes, containers, and microservices

- Good knowledge of log management, tracing, and alert tuning

- Proficient with scripting (Python, Shell) and IaC tools (Terraform preferred)

- Solid understanding of DevOps/SRE practices and incident management


Nice-to-Have Skills :


- Datadog certifications (e.g., Datadog Certified Observability Engineer)

- Experience integrating Datadog with CI/CD tools, ticketing systems, and chatops

- Familiarity with other monitoring tools (e.g., Prometheus, Grafana, New Relic, Splunk)

- Knowledge of performance testing tools (e.g., JMeter, k6)


info-icon

Did you find something suspicious?