Posted on: 13/01/2026
Role Overview :
We are looking for a Junior DevOps Engineer to help operate, scale, and improve highly available, customer-facing production systems.
This role is ideal for an engineer with strong hands-on experience in cloud-native infrastructure who enjoys ownership, automation, and working with modern distributed systems.
You will work closely with engineering teams to ensure reliability, performance, and cost efficiency across production environments while continuously improving deployment and operational practices.
Key Responsibilities:
- Operate and maintain high-scale, distributed production platforms with strong focus on uptime, performance, and reliability.
- Manage and support Kubernetes clusters (multi-cluster, multi-tenant) using Helm for application deployments.
- Run and maintain production-grade data systems, including Kafka, ClickHouse, and PostgreSQL.
- Support and enhance CI/CD pipelines using tools such as ArgoCD, GitHub Actions, GitLab CI, or similar.
- Implement and maintain GitOps workflows and Infrastructure-as-Code using Terraform or Pulumi.
- Monitor, troubleshoot, and debug high-traffic production systems, responding to incidents and performance issues.
- Drive automation initiatives to reduce manual effort and improve operational efficiency.
- Participate in cloud cost optimization, capacity planning, and infrastructure scaling activities.
- Collaborate with developers to improve deployment reliability, observability, and system resilience.
- Take ownership of assigned services and infrastructure components as an individual contributor.
Required Skills & Experience:
- 3 - 4 years of experience in DevOps or SRE roles supporting production environments.
- Strong hands-on experience with Kubernetes and Helm, including real-world operational usage.
- Practical experience operating Kafka, ClickHouse, and PostgreSQL in production.
- Solid understanding of cloud infrastructure on AWS and/or GCP (networking, storage, compute).
- Experience with Infrastructure-as-Code tools such as Terraform or Pulumi.
- Hands-on exposure to CI/CD systems (ArgoCD, GitHub Actions, GitLab CI, etc.
- Proven experience with automation, monitoring, and uptime ownership.
- Strong troubleshooting skills for distributed systems and production incidents.
- Ability to work independently, prioritize effectively, and take ownership of outcomes
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1600694