Posted on: 24/09/2025
Job Description :
Responsibilities :
- Design, deploy, and manage Kubernetes clusters (EKS, self-managed, and hybrid).
- Ensure high availability, scalability, and resilience of containerized platforms.
- Implement and manage networking (CNI plugins, ingress controllers, service mesh) for cluster connectivity.
- Manage container security, image scanning, and policy enforcement ( Kyverno).
- Automate cluster provisioning, upgrades, and patching using tools like Terraform, Ansible, or Helm.
- Integrate observability tools (Prometheus, Grafana, Datadog) for monitoring and alerting.
- Troubleshoot node, pod, networking, and storage issues to ensure reliable application performance.
- Optimize resource usage (requests/limits, HPA, VPA) and cluster cost efficiency.
- Support CI/CD integration with Kubernetes (GitOps tools like ArgoCD)
- Collaborate with security, cloud foundation, and developer teams to provide a robust container platform.
Required Skills :
- Deep knowledge of Kubernetes internals (control plane, scheduling, networking, storage).
- Experience with containerization (Docker) and container runtime interfaces .
- Strong skills in Infrastructure as Code (Terraform, Helm, Ansible).
- Proficiency with Linux system administration and troubleshooting.
- Hands-on with CNCF ecosystem tools (ArgoCD, Prometheus, Grafana).
- Familiarity with cloud-managed Kubernetes services (EKS)
- Good understanding of RBAC, IAM, TLS, and Kubernetes security hardening practices.
- Knowledge of logging and monitoring stacks (EFK/ELK, Loki, Dynatrace, Datadog).
- Strong troubleshooting skills (pods crashloop, node not ready, networking bottlenecks, storage issues).
- Experience in scaling multi-cluster and multi-tenant Kubernetes environments.
Certifications : Certified Kubernetes Administrator (CKA) ( Must Have )
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1551273
Interview Questions for you
View All