HamburgerMenu
hirist

SolarWinds - Senior Manager - Site Reliability Engineering

Solarwinds India Pvt Ltd
Bangalore
10 - 15 Years
star-icon
3.8white-divider22+ Reviews

Posted on: 17/11/2025

Job Description

Description :


At SolarWinds, were a people-first company.

Our purpose is to enrich the lives of the people we serveincluding our employees, customers, shareholders, partners, and communities.

Join us in our mission to help customers accelerate business transformation with simple, powerful, and secure solutions.

The ideal candidate thrives in an innovative, fast-paced environment and is collaborative, accountable, ready, and empathetic.

Were looking for individuals who believe they can accomplish more as a team and create lasting growth for themselves and others.

We hire based on attitude, competency, and commitment.

Solarians are ready to advance our world-class solutions in a fast-paced environment and accept the challenge to lead with purpose.

If youre looking to build your career with an exceptional team, youve come to the right place.

Join SolarWinds and grow with us!

About the Role :


Were looking for a Senior Manager, Platform Engineering to lead two core pillars for the SolarWinds Observability Platform : Developer Experience (DevEx) and Site Reliability Engineering (SRE).

Youll guide a team that builds internal developer platforms, improves CI/CD and testing workflows, and ensures our cloud-native services remain reliable, scalable, and performant.

This is a hands-on leadership role with strong ownership, high visibility, and impact across global engineering teams.

What Youll Do :


Platform Engineering & DevEx :


- Lead a team focused on improving developer workflows, CI/CD pipelines, and test automation.

- Build tools, templates, and automation that make development easier, faster, and more consistent.

- Drive adoption of best practices that improve engineering productivity and reduce friction.

- Track and improve DevEx metrics such as deployment frequency, build times, and MTTR.

Site Reliability Engineering :


- Lead SREs responsible for uptime, scalability, performance, and operational excellence.

- Implement SRE fundamentals : SLOs, proactive alerting, incident response, capacity and cost management.

- Strengthen observability, monitoring, and deployment reliability across Kubernetes-based services.

- Guide architectural decisions across AWS, Azure, Kubernetes, Terraform, and GitOps workflows.

- Partner with engineering, product, and infrastructure teams to align platform initiatives with business needs.

- Promote strong engineering standards, documentation, and technical ownership.

Must Have :


- 10+ years in SRE, Platform Engineering, DevOps, or Cloud Infrastructure.

- 5+ years managing engineering teams.

- Strong hands-on experience with Kubernetes, AWS/Azure, CI/CD, Terraform, and GitOps (ArgoCD/Flux).

- Solid understanding of SRE practices (SLOs, incident management, capacity, cost).

- Experience improving developer experience, workflows, or internal platforms.

- Excellent communication and ability to influence across global teams.

- Experience with internal developer portals (e.g., Backstage).

- Background in distributed systems or SaaS platforms.

- Programming experience in Go, Java, or Python.


info-icon

Did you find something suspicious?