HamburgerMenu
hirist

Job Description

Job Title : Datadog developer.


Location : PAN INDIA.

Experience : 6 To 10 Years.

Job Type : Contract to hire.

Notice Period : Immediate joiners.

Mandatory Skills : Datadog, Kubernetes, Docker.


Job description.

Must have skill :

- Datadog.

- Kubernetes.

- Docker.


You will be responsible for designing, implementing, and managing Datadog solution, ensuring seamless integration with Kubernetes, cloud providers, and CI/CD tools to achieve traceability and observability.


Key Responsibilities :.

- Creating Customer centric Use cases and providing consulting for custom metrics, alerts, log monitoring, analysis and visualizations in Datadog.

- Set up and configure Datadog for monitoring infrastructure, applications, and logs via automation.

- Develop a consolidated view for transaction health and include synthetic health checks for APIs.

- Analyze and correlate data across various services for troubleshooting and optimization.

- Implement AIOps for event management by centralizing and correlating events, enabling anomaly detection, and predictive incident notifications.

- Enhance proactive issue resolution and minimize downtime using advanced AI/ML tools, including LLMs for event clustering.


Technical Skills :

- Minimum 4 years of hands-on experience with Datadog, including integration with Kubernetes, cloud providers, and CI/CD tools, with an overall experience of 6+ years


- Proficiency in Datadogs Service Map, Service Catalog, and other observability tools.

- Strong understanding of cloud infrastructure (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).

- Experience with CI/CD pipelines and tools such as Jenkins, Git.

- Knowledge of application development frameworks and languages (e., Java, .NET, Node.js).

- Familiarity with scripting and automation (e., Python, Bash).

- Proven track record of setting up and managing data source integrations.


Preferred Qualifications :

- Datadog Foundation certification.

- Experience with other monitoring tools.

- Site Reliability Engineering (SRE) experience, focusing on high availability, performance, and scalability of systems.


info-icon

Did you find something suspicious?