Posted on: 17/04/2026
Description :
We are seeking a Multi-Cloud Tech Lead (L3) with strong hands-on operations experience, deep DevOps and Kubernetes expertise, and proven ability to work directly from customer premises.
This is a customer-facing role responsible for ensuring platform stability, high availability, performance, and security across multi-cloud production environments. The role serves as the primary onsite technical authority, handling L3 escalations, leading root cause analysis, and driving operational excellence.
Key Responsibilities :
- Act as the L3 escalation point for complex production issues across multi-cloud environments.
- Work closely with customer engineering, security, and business teams from the onsite location.
- Perform deep troubleshooting and root cause analysis (RCA) across cloud, Kubernetes, networking, and application layers.
- Operate, manage, and stabilize production Kubernetes platforms (GKE / AKS preferred).
- Lead incident management, problem management, and preventive action initiatives.
- Support and enhance CI/CD pipelines, deployment reliability, and automation.
- Implement and govern change management, release management, and operational best practices.
- Improve monitoring, alerting, and observability to reduce MTTR.
- Ensure compliance with security, regulatory, and audit requirements in regulated environments.
- Mentor L1/L2 engineers and act as the technical representative of the operations team to the customer.
- Drive cost optimization and operational efficiency initiatives.
Mandatory Skills & Experience :
- Multi-Cloud Operations : Hands-on experience with at least two public cloud platforms (AWS,GCP & Azure preferred).
- Kubernetes : Strong hands-on experience managing production-grade clusters.
- DevOps : CI/CD pipelines, automation, and Infrastructure as Code (Terraform preferred).
- Troubleshooting : Advanced expertise in Linux, networking, performance tuning, and distributed systems.
- Observability : Experience with monitoring, logging, and alerting tools (Dynatrace / ELK preferred).
- Customer-Facing Experience : Proven ability to work onsite and engage directly with customer stakeholders.
- SRE Exposure : Reliability engineering, availability, SLIs/SLOs, and resilience practices.
- Cost & Efficiency : Experience with cloud cost optimization and operational improvements.
Did you find something suspicious?
Posted by
Channaraju G D
Talent Acquisition Lead at BOOTLABS TECHNOLOGIES PRIVATE LIMITED
Last Active: 17 Apr 2026
Posted in
DevOps / SRE
Functional Area
Cloud Computing
Job Code
1629130