Posted on: 01/04/2026
Job Description :
Location : Work From Office
Experience : 5 to 8 Years
Role Overview :
We are looking for a DevOps Lead to drive the design, scalability, reliability, and security of our infrastructure across cloud environments. This role involves leading the DevOps function, defining best practices, and ensuring high availability for large-scale distributed systems.
You will work closely with engineering, data, and product teams to build a robust, automated, and cost-efficient infrastructure platform.
Key Responsibilities :
- Lead the design and management of cloud infrastructure across AWS and GCP.
- Define and implement DevOps, SRE, and platform engineering best practices.
- Architect and manage high-scale Kubernetes clusters and microservices infrastructure.
- Build and optimize CI/CD pipelines for reliability, speed, and security.
- Drive Infrastructure as Code (IaC) adoption using Terraform or similar tools.
- Ensure high availability, fault tolerance, and disaster recovery strategies.
- Own observability stack (monitoring, logging, tracing) and incident management processes.
- Define and enforce security, compliance, and DevSecOps practices.
- Optimize cloud costs (FinOps) and resource utilization at scale.
- Lead incident response, RCA processes, and continuous reliability improvements.
- Collaborate with engineering teams to improve deployment velocity and system resilience.
- Mentor and manage a team of DevOps engineers.
Required Skills :
- 5 to 8 years of experience in DevOps / SRE / Platform Engineering, with leadership experience.
- Strong hands-on expertise in AWS and/or GCP cloud platforms.
- Deep experience with Kubernetes (EKS/GKE) and containerization (Docker).
- Strong experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.).
- Expertise in Infrastructure as Code (Terraform preferred).
- Strong understanding of distributed systems, networking, and Linux internals.
- Experience with monitoring & observability tools (Prometheus, Grafana, ELK, OpenSearch, Datadog, etc.).
- Proven experience handling high-scale systems (high TPS / large user base).
Good to Have :
- Experience with multi-cloud or hybrid cloud architectures.
- Exposure to service mesh (Istio/Linkerd) and advanced Kubernetes concepts.
- Experience with security frameworks (CIS, IAM best practices, zero trust).
- Familiarity with OpenTelemetry and distributed tracing.
- Experience with data platforms (Kafka, BigQuery, Redshift, etc.).
Leadership Expectations :
- Ability to drive architecture decisions and long-term platform strategy.
- Strong focus on automation, reliability, and developer productivity.
- Experience in team mentoring, hiring, and performance management.
- Ownership mindset with ability to work in high-growth, fast-paced environments.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1625182