HamburgerMenu
hirist

Job Description

Role Overview :

We are looking for an Infra (DevOps + SRE) Manager to lead and scale our cloud infrastructure and reliability engineering function.

This role is responsible for owning platform reliability, availability, scalability, security, and cost efficiency, while building and mentoring a high-performing DevOps/SRE team.

You will work closely with Engineering, Product, and Security leaders to ensure our B2B SaaS platform operates reliably at scale and supports rapid product innovation.

This is a hands-on leadership role you are expected to guide architecture and critical technical decisions, while enabling your team to execute.

Key Responsibilities :

Team & Leadership :

- Build, lead, and mentor a team of DevOps and SRE engineers.

- Drive hiring, onboarding, goal-setting, performance reviews, and career development.

- Foster a strong culture of ownership, automation, reliability, and continuous improvement.

- Act as a multiplier unblock teams, set direction, and raise the overall engineering bar.

Infrastructure & Platform Ownership :

- Own the cloud infrastructure strategy across AWS (primary), including scalability, resiliency, security, and cost optimization.

- Define and evolve Infrastructure as Code (IaC) standards using Terraform.

- Establish best practices for CI/CD, container orchestration (EKS/ECS), and cloud-native architectures.

- Partner with engineering teams to design infrastructure that supports rapid, safe deployments.

SRE & Reliability :

- Define and drive SRE practices, including SLIs, SLOs, error budgets, and reliability reviews.

- Lead incident management, serving as an incident commander for critical outages.

- Drive blameless postmortems, ensuring actionable follow-ups and systemic improvements.

- Ensure strong observability across systems using monitoring, logging, and alerting platforms.

Security, Compliance & Governance :

- Collaborate with Security teams to embed DevSecOps practices into pipelines and infrastructure.

- Ensure compliance with standards such as SOC 2, ISO 27001, and internal security policies.

- Oversee secrets management, access control, encryption, and vulnerability management.

- Own infrastructure cost visibility, optimization, and budget accountability.

Cross-functional Collaboration :

- Partner closely with Engineering Managers, Architects, and Product leaders on roadmap planning.

- Act as a trusted advisor on infrastructure, scalability, and operational readiness.

- Communicate clearly with global stakeholders across time zones.

Required Skills & Experience :

- Bachelors degree in Computer Science or a related field.

- 8+ years of experience in DevOps / SRE / Infrastructure roles.

- 2+ years of people management experience, leading DevOps or SRE teams.

- Strong expertise in AWS, including EKS/ECS, ECR, S3, RDS, KMS, MSK, and Elasticsearch.

- Deep experience with Terraform and infrastructure automation.

- Solid understanding of CI/CD systems (Jenkins, AWS CodePipeline, GitHub Actions).

- Strong knowledge of Kubernetes, containerized workloads, and microservices.

- Experience operating production SaaS platforms at scale.

- Excellent communication skills and the ability to influence across teams.

Nice to Have :

- Experience building or scaling SRE practices in a fast-growing startup.

- Strong understanding of cloud cost optimization and FinOps.

- Experience with tools like Datadog, New Relic, PagerDuty, ELK, Grafana.

- Knowledge of identity and access management (AWS Cognito, Keycloak).

- AWS or Kubernetes certifications (DevOps Pro, CKA, etc.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in