Posted on: 04/08/2025
DevOps/SRE Engineering Leader Job Description
Role Summary :
We are looking for a highly accomplished DevOps/SRE Engineering Leader to oversee multi-cloud infrastructure, manage critical Kubernetes environments, lead a high-performing engineering team, and ensure enterprise-grade reliability, automation, and security across global cloud platforms. You will be responsible for strategic DevOps and SRE initiatives, platform stability, security compliance, and FinOps optimization in a dynamic, fast-paced technology environment.
Key Responsibilities :
- Manage critical SaaS infrastructure with 99.99% uptime requirements across global regions.
- Design and optimize CI/CD pipelines using Jenkins, GitLab CI, Azure DevOps, Helm, and GitOps.
- Implement Infrastructure as Code using Terraform, Ansible, and Python.
- Oversee container orchestration with Kubernetes (AKS/EKS), Docker, and Rancher.
- Define and enforce cloud security, compliance (SOC 2, PCI DSS, ISO 27001), and governance standards.
- Drive cloud cost optimization through FinOps best practices and tooling (e.g., Cast AI, Azure Cost Management).
- Lead observability, monitoring, and incident response using tools such as Prometheus, Grafana, Datadog, ELK Stack, and
Azure Monitor.
- Manage stakeholder communication, project delivery, and resource planning aligned with business OKRs.
- Mentor and scale distributed engineering teams fostering a culture of technical excellence and accountability.
- Deliver high-scale platform modernization, cloud migration, and automation initiatives.
Required Skills and Technologies :
- Cloud Platforms : AWS, Azure, Oracle Cloud, GCP.
- CI/CD & DevOps : Jenkins, GitLab, Azure DevOps, Helm, GitOps.
- IaC Tools : Terraform, Ansible, Puppet, Shell scripting, Python.
- Containers & Orchestration : Docker, Kubernetes (AKS/EKS), Rancher, OpenShift.
- Monitoring & Observability : Grafana, Prometheus, ELK, Datadog, New Relic.
- Security & Compliance : SOC 2, PCI DSS, ISO 27001, IAM, PIM.
- Network & Infrastructure : Cisco, FortiGate, VMware, KVM, HPE, Veritas NetBackup.
- Project & Team Management : Agile, Scrum, Jira, Confluence, ITSM, ITIL.
Qualifications :
- Bachelor's or Master's degree in Computer Science, IT, Telecommunications, or a related field.
Preferred Certifications :
- Certified Kubernetes Administrator (CKA).
- AWS/Azure Certified Solutions Architect.
- (ISC) Certified in Cybersecurity (CC).
- ITIL, Cisco CCNP or Specialist-level certifications.
Preferred Experience :
- 10+ years of experience in SRE/DevOps with at least 5 years in technical leadership roles.
- Proven success in large-scale cloud architecture and production operations.
- Experience working with global teams and international client stakeholders.
- Hands-on expertise in cloud migration, application modernization, and platform security.
Soft Skills :
- Strong communication & stakeholder management.
- Analytical problem-solving.
- Resilience and adaptability in dynamic environments.
- Commitment to continuous learning and innovation.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1524221
Interview Questions for you
View All