Posted on: 24/01/2026
Description :
About the Role :
Were looking for a skilled Site Reliability Engineer (SRE) to join our platform engineering team supporting a critical enterprise grade .NET application hosted on Azure, with upcoming cloud transformation initiatives targeting Google Cloud Platform (GCP).
This role demands strong operational discipline, infrastructure-as-code expertise, and a solid understanding of both application and database ecosystems.
You will work closely with development and infrastructure teams to improve system reliability, performance, scalability, and automation.
You will create a bridge between development and operations by applying a software engineering mindset to system administration topics.
Your time will split between operations/on-call duties and developing systems and software to continuously improve system reliability and performance.
Key Responsibilities :
- Manage, monitor, and scale .NET-based applications hosted on Azure.
- Write and manage infrastructure as code using Terraform.
- Troubleshoot and optimize Microsoft SQL Server (MSSQL) databases.
- Ensure high availability and reliability of application services across environments.
- Automate build/deploy/monitoring pipelines (CI/CD, alerting, healing).
- Participate in on-call rotation and own incident management/resolution workflows.
- Contribute to GCP migration planning and foundational infrastructure setup.
- Identify and implement SLOs, SLIs, and error budgets.
- Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
- Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and dont get paged when it doesnt).
- Use practices from DevOps and GitOps to improve automation and processes to make self service possible.
- Safeguarding reliability.
- Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
Requirements :
Must-Have Skills :
- 4 - 6 years in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.
- Experience with Azure cloud services and resource management.
- Hands-on with Terraform (HCL) for infrastructure provisioning.
- Proficient in .NET application support and performance tuning.
- Strong working knowledge of Microsoft SQL Server (MSSQL) or MySQL.
- Solid understanding of monitoring tools, logging frameworks, and observability practices.
- Experience with CI/CD pipelines (Azure DevOps preferred).
Nice-to-Have Skills :
- Exposure to Google Cloud Platform (GCP) and multi-cloud architecture.
- Experience working on Windows-based server environments.
- Familiarity with containerization (Docker, Kubernetes).
- Prior participation in cloud migration or modernization projects.
Minimum Qualifications :
- BS in Computer Science, Information Technology, Business / Management Information Systems or related field or equivalent experience.
- Typically minimum of 4 years relevant experience
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1605856