Job Objective :
Lead the architecture, deployment, and operations of large-scale, secure, and resilient cloud infrastructure, ensuring high availability, cost efficiency, and compliance through AWS, Kubernetes/ROSA, and DevOps best practices.
Qualifications & Experience :
- B.Tech/B.E. in CS/IT or related; M.Tech/specialized cloud training a plus.
- 10+ years in infra/cloud/SA roles; 4+ years leading large cloud deployments.
Mandatory : Prior government/public sector project experience
Key Responsibilities :
- Architect and operate AWS environments capable of large-scale usage; design VPC, compute, storage and network for growth and resilience.
- Run ROSA (OpenShift on AWS)/Kubernetes; manage Postgres clusters, Redis caches, search/analytics (Elasticsearch, Cassandra) and secure connectivity.
- Implement HA/DR and backup/restore; execute incident response and capacity planning.
- Lead IaC/DevOps (Terraform/CloudFormation, Ansible, Jenkins) for repeatable, compliant environments.
- Establish observability (Dynatrace, Graylog, Prometheus, Grafana), optimize cost (FinOps), and align with Security on IAM/KMS/WAF.
Essential Skills :
- AWS core: VPC/IAM, EC2/EKS, RDS (Postgres), S3, ALB/NLB.
- Containers & mesh: Kubernetes, ROSA, Istio.
- IaC & CI/CD: Terraform, Ansible, Jenkins, SonarQube.
- Datastores: Postgres, MongoDB, Redis, Elasticsearch, Cassandra; HA/DR.
- Observability: Dynatrace, Graylog, Prometheus, Grafana.
Desirable Skills :
- Certifications : AWS SA-Pro, CKA, OpenShift Admin.
- Govt cloud standards (MeghRaj/GCC).
- FinOps literacy for AWS usage/costs.
- Large-scale performance/load testing experience.