HamburgerMenu
hirist

Lead Devops Engineer - Cloud Infrastructure

Scaling Theory Technologies Pvt Ltd
8 - 15 Years
Multiple Locations

Posted on: 03/03/2026

Job Description

Description :



Role Overview :

We are seeking a highly experienced Lead DevOps Engineer to define and drive DevOps strategy, cloud infrastructure, and production reliability across enterprise-scale systems. This is a hands-on leadership role with direct client interaction, requiring deep expertise in cloud platforms, automation, CI/CD, container orchestration, and cost optimization.

The ideal candidate embodies an automation-first mindset someone who consistently transforms manual processes into scalable, reliable, and repeatable systems. You will lead DevOps initiatives across multi-cloud and hybrid environments, ensuring high availability, security, performance, and cost efficiency.

Key Responsibilities :



DevOps Leadership & Team Management :

- Lead, mentor, and manage a team of 23 DevOps engineers (Infrastructure and Application DevOps).

- Establish and enforce DevOps best practices and an automation-driven culture.

- Conduct code reviews, streamline workflows, and promote knowledge sharing.

- Drive operational excellence across all DevOps functions.

Cloud & Infrastructure Ownership :

- Own and manage end-to-end infrastructure across Microsoft Azure (primary), AWS, GCP, and hybrid/on-prem environments.

- Architect scalable, resilient, and highly available cloud systems.

- Demonstrate deep expertise in Azure services including :

1. AKS

2. Azure DevOps & Azure Pipelines

3. Networking

4. Storage

5. IAM

6. Monitoring

Containerization & CI/CD :

- Design and manage containerized environments using Docker and Kubernetes.

- Build and optimize CI/CD pipelines using tools such as :

1. Azure Pipelines

2. GitHub Actions

3. GitLab CI

4. Jenkins

5. ArgoCD

- Implement Infrastructure as Code (IaC) using Terraform with modular, reusable components.

- Drive end-to-end automation across build, test, deploy, and release cycles.

4. Production Operations & Reliability :

- Own uptime, scalability, reliability, and performance of production systems.

- Implement observability and monitoring using tools like Prometheus, Grafana, ELK, Datadog, or New Relic.

- Lead incident management, root cause analysis (RCA), and preventive improvements.

- Conduct capacity planning and infrastructure right-sizing.

5. Cloud Cost Optimization (FinOps) :

- Monitor and optimize cloud spend across Azure, AWS, and GCP.

- Prepare and present weekly, monthly, and quarterly cost reports to leadership and clients.

- Build forecasts, budgets, and cost projections.

- Recommend cost-efficient architectural improvements aligned with FinOps principles.

6. Security, Governance & Compliance :

- Implement cloud security best practices and automated policy enforcement.

- Ensure least-privilege access control and secure CI/CD pipelines.

- Partner with security teams on audits, vulnerability management, and system hardening.

- Maintain compliance readiness across environments.

7. Client Engagement :

- Serve as the primary technical point of contact for clients.

- Present architecture designs, production performance metrics, and cost insights.

- Translate complex technical topics into clear, business-aligned communication.

Required Skills & Experience :



Technical Requirements :



- 8 to 10 years of DevOps experience, including at least 2+ years in a leadership role.

- Strong hands-on expertise in :

1. Azure, AKS, Azure DevOps, Azure Pipelines

2. Docker & Kubernetes

3. Terraform (Infrastructure as Code)

4. CI/CD pipeline architecture and automation

- Working knowledge of AWS and GCP.

- Experience managing hybrid or on-prem infrastructure.

- Strong understanding of networking (VPC/VNet, routing, load balancers), monitoring, HA/DR, and scaling strategies.

Automation Mindset :



- Demonstrated automation-first approach to infrastructure and operations.

- Proven track record of eliminating manual processes via scripting, pipelines, and IaC.

- Proficiency in scripting languages such as Bash, Python, or PowerShell.

Operational Excellence :



- Experience managing high-traffic, mission-critical production systems.

- Strong troubleshooting, incident management, and performance optimization capabilities.

- Expertise in capacity planning and cloud cost optimization.

Soft Skills :



- Excellent written and verbal communication skills.

- Strong client-facing confidence and presentation ability.

- Demonstrated leadership, ownership, and mentoring skills.

Good-to-Have Skills :



- GitOps tools (ArgoCD, FluxCD)

- Service Mesh technologies (Istio, Linkerd)

- Serverless architectures (Azure Functions, AWS Lambda)

- FinOps or cloud governance expertise

- Experience supporting AI, Big Data, or advanced workloads

Why Join Us :



- Lead DevOps initiatives for large-scale enterprise cloud and hybrid systems.

- High-visibility role with direct exposure to clients and leadership.

- Clear growth path toward Cloud Architect or DevOps Manager roles.

- Work with modern, automation-driven cloud technologies at scale.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in