Posted on: 18/04/2026
Description :
- Infrastructure at Scale : Design and evolve our cloud-native infrastructure (AWS/Kubernetes), ensuring availability, performance, and cost efficiency across regions and products.
- Platform & Developer Experience : Build internal tools and platforms that help engineers deploy, monitor, and scale their services independently with minimal friction and maximum confidence.
- CI/CD & Release Automation : Architect secure, fast, and scalable CI/CD pipelines across multiple environments using tools like GitHub Actions, and Jenkins.
- Reliability Engineering : Champion observability, SLOs, and incident response practices. Drive a culture of proactive performance monitoring and resilient system design.
- Security & Governance : Integrate DevSecOps practices from policy-as-code and automated audits to secure secrets management and vulnerability scanning.
- Mentorship & Thought Leadership : Guide and mentor DevOps and SRE engineers. Partner closely with platform developers on infrastructure strategy, deployment patterns, and production readiness.
Ideal Candidate :
Strong Senior / Staff DevOps Engineer Profile
Mandatory (Experience 1) : Must have 9+ years in DevOps / SRE / Infrastructure roles with hands-on experience (clear scale signals like traffic, uptime, latency, infra size should be mentioned)
Mandatory (Experience 2) : Must have worked in Staff / Lead DevOps / SRE / Platform Engineer role OR demonstrated ownership of infra/platform across teams (not just execution role)
Mandatory (Experience 3) : Must have B2B SaaS company experience with multi-tenant architecture OR multiple production stacks (multi-env / multi-client systems)
Mandatory (Tech Skills 1 - Cloud & Infra) : AWS (VPC, EKS, EC2, RDS, networking), Kubernetes (EKS) at scale, Designing high availability, multi-region systems
Mandatory (Tech Skills 2 - Automation & IaC) : Terraform (must-have), Helm / GitOps, Strong scripting (Python / Go / Bash)
Mandatory (Tech Skills 3 - CI/CD & Release) : Scalable CI/CD pipelines (GitHub Actions / Jenkins), Zero/low downtime deployments
Mandatory (Tech Skills 4 - Reliability & Observability) : SRE principles (SLOs, SLIs, error budgets), Monitoring tools (Prometheus, Grafana, Datadog), Alerting, on-call, incident management
Mandatory (Education) : BTech in Computer Science or related fields
Mandatory (Company) : Strong B2B SaaS product companies only
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1629458