Posted on: 08/01/2026
About the Role :
As we continue to scale and expand our services, we are seeking a skilled and experienced individual who possesses a strong background in Azure Cloud, Incident Management, SRE Principles and Automation.
Responsibilities :
Azure Cloud Expertise :
- Solid knowledge and hands-on experience with Microsoft Azure services and solutions.
- Ability to design, deploy, and manage cloud-based architectures on Azure.
- Proficiency in using Azure tools for monitoring, automation, and scaling.
Incident Management :
- Demonstrated expertise in incident identification, response, resolution, and post-incident analysis.
- Experience with incident escalation procedures, ensuring timely and effective responses.
- Strong analytical and problem-solving skills to address incidents swiftly and efficiently.
- Skilled in conducting thorough incident root cause analysis.
SRE Principles :
- Comprehensive understanding of Site Reliability Engineering (SRE) methodologies.
- Implementation of SRE best practices to enhance system stability, performance, and observability.
- Proactive monitoring and alerting strategies to detect and mitigate potential issues.
- Continuous improvement mindset, contributing to the development and enhancement of SRE processes.
Automation :
- Expertise in automating infrastructure provisioning, configuration, and management.
- Proficiency in scripting languages (e.g., Python, PowerShell) to build automation solutions.
- Experience with Infrastructure as Code (IaC) tools like Terraform is a plus.
Qualifications :
- Additionally, the ideal candidate should possess excellent communication and collaboration skills, as they will be working closely with cross-functional teams, including developers, operations, and product managers (if required).
Required Skills : Azure Cloud, Incident Management, SRE Principles, Automation, Communication, Collaboration.
Preferred Skills : Experience with Infrastructure as Code (IaC) tools like Terraform.
Equal Opportunity Statement : We are committed to diversity and inclusivity.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1598477