Posted on: 01/04/2026
Description :
Role Overview :
We are looking for a highly skilled Azure Site Reliability Engineer (SRE) to join our Cloud Engineering team.
The ideal candidate brings strong expertise in Terraform and Python scripting, with a proven ability to design, automate, and operate highly available, scalable, and secure cloud infrastructure on Microsoft Azure.
This role requires a proactive engineer who is passionate about automation, reliability, performance optimization, and operational excellence in cloud environments.
Key Responsibilities :
Cloud Infrastructure & Automation :
- Architect, deploy, and manage scalable, secure, and highly available infrastructure on Microsoft Azure.
- Develop, maintain, and enhance Infrastructure as Code (IaC) using Terraform.
- Build automation frameworks and operational tooling using Python and Shell scripting.
- Drive infrastructure standardization and reusable deployment patterns across environments.
CI/CD & DevOps Enablement :
- Design and implement CI/CD pipelines for both infrastructure and application deployments.
- Promote DevOps best practices, including automated testing, validation, and release management.
- Enable seamless environment provisioning and configuration management.
Reliability & Monitoring :
- Implement monitoring, ing, and observability solutions using Azure-native tools.
- Continuously monitor system performance, availability, and reliability.
- Conduct root cause analysis and implement preventive measures to reduce incidents.
Performance & Cost Optimization :
- Optimize cloud resource utilization to improve performance and scalability.
- Drive cost optimization initiatives across Azure environments.
- Automate operational workflows to reduce manual effort and eliminate toil.
Collaboration & Engineering Excellence :
- Partner with development teams to improve application resilience, scalability, and fault tolerance.
- Contribute to reliability engineering practices, including incident management, postmortems, and capacity planning.
- Advocate for reliability-first design and operational best practices.
Required Skills & Qualifications :
- Strong hands-on experience with Microsoft Azure cloud services.
- Proven expertise in Terraform for Infrastructure as Code.
- Strong scripting skills in Python (Shell scripting knowledge preferred).
- Experience implementing CI/CD pipelines and DevOps workflows.
- Solid understanding of cloud networking, security, and monitoring concepts.
- Experience in production support and reliability engineering practices.
Skills :
- Azure, SRE, Monitoring tool, Terraform
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1625180