HamburgerMenu
hirist

Site Reliability Engineer - Azure Cloud Services

QCENTRIO PRIVATE LIMITED
5 - 8 Years
rupee22-26 LPA
Anywhere in India/Multiple Locations

Posted on: 01/04/2026

Job Description

Description :

Role Overview :

We are looking for a highly skilled Azure Site Reliability Engineer (SRE) to join our Cloud Engineering team.

The ideal candidate brings strong expertise in Terraform and Python scripting, with a proven ability to design, automate, and operate highly available, scalable, and secure cloud infrastructure on Microsoft Azure.

This role requires a proactive engineer who is passionate about automation, reliability, performance optimization, and operational excellence in cloud environments.

Key Responsibilities :

Cloud Infrastructure & Automation :

- Architect, deploy, and manage scalable, secure, and highly available infrastructure on Microsoft Azure.

- Develop, maintain, and enhance Infrastructure as Code (IaC) using Terraform.

- Build automation frameworks and operational tooling using Python and Shell scripting.

- Drive infrastructure standardization and reusable deployment patterns across environments.

CI/CD & DevOps Enablement :

- Design and implement CI/CD pipelines for both infrastructure and application deployments.

- Promote DevOps best practices, including automated testing, validation, and release management.

- Enable seamless environment provisioning and configuration management.

Reliability & Monitoring :

- Implement monitoring, ing, and observability solutions using Azure-native tools.

- Continuously monitor system performance, availability, and reliability.

- Conduct root cause analysis and implement preventive measures to reduce incidents.

Performance & Cost Optimization :

- Optimize cloud resource utilization to improve performance and scalability.

- Drive cost optimization initiatives across Azure environments.

- Automate operational workflows to reduce manual effort and eliminate toil.

Collaboration & Engineering Excellence :

- Partner with development teams to improve application resilience, scalability, and fault tolerance.

- Contribute to reliability engineering practices, including incident management, postmortems, and capacity planning.

- Advocate for reliability-first design and operational best practices.

Required Skills & Qualifications :

- Strong hands-on experience with Microsoft Azure cloud services.

- Proven expertise in Terraform for Infrastructure as Code.

- Strong scripting skills in Python (Shell scripting knowledge preferred).

- Experience implementing CI/CD pipelines and DevOps workflows.

- Solid understanding of cloud networking, security, and monitoring concepts.

- Experience in production support and reliability engineering practices.

Skills :

- Azure, SRE, Monitoring tool, Terraform

info-icon

Did you find something suspicious?

Similar jobs that you might be interested in