Posted on: 20/02/2026
Description :
1. AWS (Amazon Web Services) :
- Strong hands-on expertise with core services : EC2, S3, IAM, RDS, Lambda, VPC, CloudWatch, CloudTrail, Kafka.
- Experience designing, implementing, and managing scalable, fault-tolerant cloud infrastructure.
- Knowledge of security best practices : IAM roles/policies, key management, encryption at rest/in transit.
- Exposure to cost optimization, monitoring, and centralized logging using AWS-native tools.
- Familiarity with multi-account strategy (Organizations, SCPs) and advanced networking (Transit Gateway, VPC peering).
2. Terraform (Infrastructure as Code - IaC) :
- Proficient in writing modular, reusable Terraform configurations with workspaces, modules, and remote backends.
- Hands-on with state management (Terraform Cloud / S3 + DynamoDB for state locking).
- Experience integrating Terraform with CI/CD pipelines for automated provisioning.
- Understanding of infrastructure versioning, drift detection, and policy enforcement (Sentinel / OPA).
3. ArgoCD (GitOps) :
- Strong understanding of GitOps workflows and ArgoCD as a Kubernetes deployment controller.
- Experience deploying and managing ArgoCD in production-grade clusters.
- Hands-on with ApplicationSets, Sync Policies, and RBAC in ArgoCD.
- Skilled in debugging sync/health issues and enforcing secure GitOps practices.
4. Kubernetes + Helm / Kustomize :
- Advanced experience managing Kubernetes clusters (EKS preferred).
- Proficient in Helm chart development, customization, and templating.
- Experience with Kustomize overlays for environment-specific configuration.
- Familiar with secret management using Sealed Secrets / SOPS.
- Ability to evaluate trade-offs between Helm and Kustomize for application delivery.
5. CI/CD - Jenkins :
- Strong scripting skills - Linux , shell , python
- Hands-on with Jenkins (Freestyle & Pipeline) and Groovy scripting.
- Experience managing Jenkins Master-Agent architecture, scaling, and plugin optimization.
- Skilled in designing robust CI/CD pipelines for cloud-native and containerized workloads.
6. Security (Cloud & Container Security) :
- Experience leading end-to-end security posture improvements across AWS and Kubernetes environments.
- Hands-on with VAPT lifecycle : coordinating assessments, analyzing vulnerabilities, driving remediation.
- Implementation of cloud security best practices : IAM least privilege, WAF, GuardDuty, Security Hub, audit logging.
- Kubernetes security : network policies, PodSecurity standards, admission controllers (OPA/Gatekeeper), CIS Benchmark compliance.
- Container security : Docker image scanning using Trivy, Clair, Aqua, or equivalent; remediation of CVEs.
7. Observability & Monitoring (Platform Reliability) :
- Strong hands-on experience with Prometheus for metrics collection, alerting, and custom exporters across Kubernetes and AWS environments.
- Advanced dashboarding and visualization using Grafana (infra, application, and SLO/SLI dashboards).
- Centralized logging implementation using Loki and ELK stack (Elasticsearch, Logstash, Kibana).
- Distributed tracing implementation using OpenTelemetry integrated with Jaeger/Tempo or equivalent backends.
- Experience defining Golden Signals, SLOs/SLAs, alert optimization, and proactive reliability engineering practices.
8. Databases (MongoDB & PostgreSQL) :
- Hands-on experience managing MongoDB and PostgreSQL in production environments.
- Familiar with cluster setup, backup/restore strategies, monitoring, and scaling.
- Knowledge of performance tuning, query optimization, and high availability (replication/sharding for MongoDB, replication for Postgres).
- Experience with cost optimization for managed services (e.g., Amazon RDS, Aurora, or Atlas).
- Implemented security best practices : encryption, role-based access, auditing, and vulnerability scanning.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1614500