HamburgerMenu
hirist

BootLabs - Cloud Technical Lead - AWS/Azure/GCP

BOOTLABS TECHNOLOGIES PRIVATE LIMITED
9 - 17 Years
Multiple Locations

Posted on: 17/04/2026

Job Description

Description :

We are seeking a Multi-Cloud Tech Lead (L3) with strong hands-on operations experience, deep DevOps and Kubernetes expertise, and proven ability to work directly from customer premises.

This is a customer-facing role responsible for ensuring platform stability, high availability, performance, and security across multi-cloud production environments. The role serves as the primary onsite technical authority, handling L3 escalations, leading root cause analysis, and driving operational excellence.

Key Responsibilities :

- Act as the L3 escalation point for complex production issues across multi-cloud environments.

- Work closely with customer engineering, security, and business teams from the onsite location.

- Perform deep troubleshooting and root cause analysis (RCA) across cloud, Kubernetes, networking, and application layers.

- Operate, manage, and stabilize production Kubernetes platforms (GKE / AKS preferred).

- Lead incident management, problem management, and preventive action initiatives.

- Support and enhance CI/CD pipelines, deployment reliability, and automation.

- Implement and govern change management, release management, and operational best practices.

- Improve monitoring, alerting, and observability to reduce MTTR.

- Ensure compliance with security, regulatory, and audit requirements in regulated environments.

- Mentor L1/L2 engineers and act as the technical representative of the operations team to the customer.

- Drive cost optimization and operational efficiency initiatives.

Mandatory Skills & Experience :

- Multi-Cloud Operations : Hands-on experience with at least two public cloud platforms (AWS,GCP & Azure preferred).

- Kubernetes : Strong hands-on experience managing production-grade clusters.

- DevOps : CI/CD pipelines, automation, and Infrastructure as Code (Terraform preferred).

- Troubleshooting : Advanced expertise in Linux, networking, performance tuning, and distributed systems.

- Observability : Experience with monitoring, logging, and alerting tools (Dynatrace / ELK preferred).

- Customer-Facing Experience : Proven ability to work onsite and engage directly with customer stakeholders.

- SRE Exposure : Reliability engineering, availability, SLIs/SLOs, and resilience practices.

- Cost & Efficiency : Experience with cloud cost optimization and operational improvements.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in