HamburgerMenu
hirist

Job Description

Job Description :


Responsibilities :


- Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.

- Develop self-healing infrastructure solutions and drive continuous operational efficiency.

- Lead efforts to build resilient systems and proactively identify potential points of failure across the stack.

- Design and implement reliability-focused automation and tooling to ensure consistent system performance and uptime.


- Support post-release validations and operational readiness assessments to ensure smooth rollouts.

- Occasional weekend support may be required (e.g., during major releases or critical changes).

- Design, implement, and manage cloud-native infrastructure using Terraform and other IaC tools.

- Ensure infrastructure follows principles of scalability, fault tolerance, and security.

- Design and implement robust monitoring and alerting solutions using Elastic Stack, OpenTelemetry (OTEL), and similar tools.

- Define and manage SLIs/SLOs, and partner with development teams to ensure service reliability.

- Partner with engineering teams to create & improve CI/CD pipelines and deployment processes.

- Provide technical leadership and recommendations to improve system architecture, release velocity, and developer productivity.

Requirements :


- Good experience on OS Linux, Cloud AWS cloud

- Strong in Terraform and Ansible infrastructure

- Good experience in DevOps, Site Reliability Engineering, or Infrastructure Engineering.

- Strong experience with AWS services and Terraform for IaC.

- Deep understanding of incident response, post-mortem analysis, and reliability engineering principles.

- Proven track record with Elastic Stack, or other observability tools.

- Proficient in scripting (Python, Bash, etc.) and working with Git-based workflows.

- Solid grasp of modern CI/CD tooling and software development lifecycle practices.

Good to have :


- Experience in Azure, Kubernetes, or container orchestration tools.


- Good to have OpenTelemetry


info-icon

Did you find something suspicious?