HamburgerMenu
hirist

Job Description

What Youll Do :

Package & Deploy :

- Develop and maintain highly reusable Helm charts, CI/CD pipelines, and deployment templates to ensure consistent delivery of services across multiple customer environments.

- Standardize containerized deployments of our software solutions across Kubernetes distributions such as AKS (Azure Kubernetes Service), EKS (Amazon Elastic Kubernetes Service), GKE (Google Kubernetes Engine), and Red Hat OpenShift.

- Enforce Kubernetes best practices, ensuring that namespaces, RBAC policies, network policies, and resource quotas are configured according to industry security and performance standards.

- Continuously evaluate and improve our deployment processes to reduce time-to-delivery and enhance scalability and reliability.

- Collaborate closely with engineering and product teams to align deployment processes with evolving software architecture.

SaaS Enablement :

- Architect, design, and manage SaaS environments spanning multiple models :

1. Single-Tenant Dedicated Clusters for enterprise customers requiring isolation.

2. Single-Tenant Shared Clusters for efficient scaling across multiple environments.

3. Multi-Tenant Shared Clusters designed for high efficiency and cost-effectiveness.

- Deploy and manage environments across AWS, Azure, and Google Cloud Platform, with a focus on security, compliance, and scalability.

- Implement infrastructure cost optimization strategies by leveraging autoscaling, spot instances, and resource right-sizing, while maintaining high availability.

- Build self-service provisioning capabilities for customer environments, empowering internal teams to spin up resources safely and efficiently.

- Collaborate with security and compliance teams to align SaaS deployments with SOC2, ISO 27001, and CIS benchmarks.

Developer Environments & Automation :

- Automate the provisioning and lifecycle management of Development, QA, and UAT environments, ensuring consistency and reliability across stages.

- Integrate automation with source control and CI/CD systems to enable on-demand environment creation for feature branches or testing pipelines.

- Develop tools and frameworks to enhance developer productivity, reduce manual configuration, and improve feedback loops between DevOps and engineering.

- Implement policy-as-code and compliance-as-code principles to standardize infrastructure governance.

- Partner with developers to identify and resolve build, deployment, and runtime issues efficiently.

Technical Implementation & Leadership :

- Take ownership of hands-on implementation of DevOps pipelines, Kubernetes infrastructure, and environment automation.

- Collaborate with architecture teams to design resilient, observable, and secure infrastructure solutions.

- (Optional) Mentor and lead a team of 510 DevOps engineers, fostering a culture of collaboration, technical excellence, and continuous learning.

- Conduct design and code reviews for automation scripts, CI/CD pipelines, and infrastructure-as-code implementations.

- Participate in on-call rotations or establish systems that ensure smooth operational response to incidents.

- Contribute to long-term platform vision and roadmap development.

Monitoring & Observability :

- Implement end-to-end observability across applications and infrastructure using OpenTelemetry (OTLP), Prometheus, Grafana, Loki, and Jaeger.

- Build dashboards, alerts, and tracing systems to ensure proactive detection of anomalies, performance bottlenecks, and resource inefficiencies.

- Integrate monitoring, logging, and tracing into CI/CD workflows to enable continuous feedback and system health visibility.

- Establish SLOs, SLIs, and SLAs to measure and maintain reliability objectives across services.

- Collaborate with engineering teams to define meaningful metrics and enable data-driven operational decisions.

What Makes You a Great Fit :

- Expert-level Kubernetes and Helm experience deep understanding of workload orchestration, service mesh configuration, ingress management, and multi-cluster operations.

- Proven ability to design and maintain complex CI/CD pipelines using Jenkins, GitHub Actions, or GitLab CI.

- Hands-on experience deploying and operating workloads in AWS, Azure, GCP, or OpenShift environments.

- Skilled in scripting with Shell, Python, or Groovy for automation, integration, and system tooling.

- Practical experience with Infrastructure as Code (IaC) using Terraform or CloudFormation, and configuration management via Ansible or Puppet.

- Deep understanding of cloud-native security and compliance :

1. Container image hardening and vulnerability scanning.

2. Role-Based Access Control (RBAC) and fine-grained permissions.

3. Secrets management using Vault, SSM, or Kubernetes secrets.

- Implementing DevOps workflows aligned with OPA policies, CIS Benchmarks, and SOC2 requirements.

- Demonstrated experience with observability stacks (OpenTelemetry, Prometheus, Grafana, Loki, Jaeger) for metrics, logs, and traces collection and visualization.

- Strong problem-solving and analytical skills with an ability to diagnose complex distributed system issues.

- Excellent communication and collaboration skills able to work cross-functionally with developers, architects, and security teams.

Bonus Points :

- Certifications in major cloud platforms (AWS Certified DevOps Engineer, Azure DevOps Expert, or GCP Professional Cloud DevOps Engineer).

- Experience implementing GitOps workflows using Argo CD or Flux.

- Exposure to virtual Kubernetes cluster tools like KinD, vCluster, or K3s for local testing and CI integration.

- Understanding of secure multi-tenant architectures and network isolation strategies.

- Development experience in Java or Python, with an understanding of microservice architectures.

- Proven track record of cloud cost optimization leveraging automation and observability to reduce operational expenditure without compromising performance.

- Experience building internal developer platforms (IDPs) or platform-as-a-service solutions for engineering teams.


info-icon

Did you find something suspicious?