Posted on: 04/11/2025
Position Overview :
We are seeking a highly skilled and experienced Azure DevOps Engineer to join our dynamic team. The ideal candidate will be responsible for designing, implementing, managing, and optimizing our mission-critical, high-throughput Azure infrastructure. You will play a pivotal role in ensuring the reliability, scalability, and performance of applications. This role demands deep technical expertise in Azure, automation, and large-scale distributed systems.
Key Responsibilities :
- Implement, manage, and optimize CI/CD pipelines within Azure DevOps (or equivalent like
GitHub Actions) to enable rapid, reliable deployments following GitOps best practices.
- Deploy, manage, and automate the scaling of containerized applications on Azure Kubernetes Service (AKS), leveraging HPA, VPA, and cluster autoscaler.
- Architect, deploy, operate, and fine-tune high-throughput messaging systems, specifically Kafka or equivalent event streaming platforms.
- Provision, configure, tune, and maintain distributed databases (e.g., Azure Cosmos DB, Cassandra, MongoDB), including implementing robust connection pooling strategies.
- Establish and manage comprehensive observability platforms (e.g., Datadog, New Relic, ELK Stack, Prometheus & Grafana) to ensure end-to-end monitoring, logging, and alerting.
- Define, track, and report on Service Level Objectives (SLOs) and Service Level Indicators (SLIs); configure actionable alerts.
- Lead troubleshooting efforts during incidents, conduct thorough post-mortems, and drive continuous improvement initiatives to enhance system reliability and efficiency.
- Develop automation scripts using Python, Bash, or PowerShell to streamline operational tasks and infrastructure management.
Required Qualifications :
- Experience : Strictly 4 to 9 years of hands-on experience as a DevOps Engineer with a strong focus on the Azure cloud platform.
- Education : Bachelors or Masters degree in Computer Science, Electrical Engineering, or a related technical discipline from a Tier 1 or Tier 2 institution.
- Prior Environment : Demonstrable experience working in product-based companies or technology startups renowned for building and managing high-volume, low-latency, mission-critical systems. Experience exclusively from service-based consulting companies will not be considered.
- High-Volume Infrastructure Expertise : Proven ability to design, operate, and scale large-scale infrastructure handling significant throughput and demanding low latency.
- Messaging/Queuing Systems : Hands-on experience deploying and managing production-grade Kafka clusters or equivalent high-throughput messaging systems.
- Container Orchestration : Expertise in deploying, managing, and auto-scaling applications on Kubernetes (AKS preferred).
- Observability : Proven experience setting up and managing comprehensive monitoring and logging solutions using tools like Datadog, New Relic, ELK, Prometheus, Grafana, or equivalents.
- Distributed Databases : Solid experience with provisioning, tuning, and maintaining distributed SQL (e.g. PostgreSQL, MySQL, etc.) and NoSQL databases (e.g., Cosmos DB, Cassandra,
MongoDB), including connection pool management.
- Automation & IaC : Strong proficiency with Infrastructure as Code tools (Terraform highly
preferred) and CI/CD pipeline automation (Azure DevOps preferred).
- Scripting : Excellent scripting skills in Python, Bash, or PowerShell.
Preferred Qualifications :
- Experience with Azure Event Hubs or similar event-driven architecture components.
- Familiarity with serverless technologies like Azure Functions.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1569474
Interview Questions for you
View All