Posted on: 17/12/2025
Experience : 5+ Years
Locations : Chennai | Bangalore
Work Mode : Onsite (Night Shift)
Employment Type : Full-time | Urgent Opening
Job Summary :
We are seeking a highly experienced Senior DevOps Engineer to lead the architecture, implementation, and automation of our multi-cloud infrastructure. In this role, you will be responsible for building resilient, self-healing systems and high-velocity deployment pipelines that support our global operations.
As a senior member of the infrastructure team, you will bridge the gap between development and operations, ensuring that security, scalability, and observability are baked into every layer of our stack. This role operates on a Night Shift to ensure seamless coordination with global time zones and 24/7 system reliability.
Key Responsibilities :
- Infrastructure as Code (IaC) : Design and maintain production-grade infrastructure on AWS and Azure using Terraform modules, ensuring environment parity across Dev, Staging, and Production.
- Orchestration & Containerization : Manage large-scale Kubernetes clusters (EKS/AKS), including the configuration of Ingress controllers, Network Policies, and Auto-scaling groups (HPA/VPA).
- CI/CD Pipeline Engineering : Architect end-to-end automated deployment pipelines using Jenkins, GitLab CI, or Azure DevOps with integrated quality gates and security scanning.
- Configuration Management : Utilize Ansible for automated server provisioning, application deployment, and continuous configuration enforcement across Linux environments.
- Observability & Monitoring : Implement comprehensive monitoring and alerting stacks using Prometheus, Grafana, and ELK (Elasticsearch, Logstash, Kibana) to ensure 99.99% uptime.
- GitOps Implementation : Drive the adoption of GitOps principles for infrastructure and application deployments using tools like ArgoCD or Flux.
- Cloud Security & Compliance : Implement DevSecOps practices, including IAM policy hardening, secret management via HashiCorp Vault, and vulnerability scanning in the build stage.
- Scripting & Tooling : Develop custom automation tools and "glue" scripts in Python, Go, or Bash to eliminate manual operational toil.
- Disaster Recovery : Design and test multi-region failover strategies and automated backup/restore procedures to ensure data integrity and business continuity.
- System Troubleshooting : Act as a final point of escalation for complex production issues, conducting deep-dive root cause analysis (RCA) on infrastructure and networking failures.
Technical Must-Have Skills :
- Cloud Platforms : Deep expertise in AWS (EC2, S3, RDS, Lambda, VPC) and/or Microsoft Azure (VMs, Blob, SQL, VNet).
- Provisioning & Management : Expert-level proficiency in Terraform and Ansible.
- Containers : Mastery of Docker and Kubernetes (Administration, Scheduling, and Security).
- CI/CD Tools : Strong experience with Jenkins, GitLab CI, or GitHub Actions.
- Operating Systems : Advanced Linux Administration skills (Ubuntu, CentOS, or RHEL) including kernel tuning and networking.
- Scripting : Proficiency in Python, Go, or Bash for backend automation and API integrations.
- Networking : Solid understanding of TCP/IP, DNS, Load Balancing (ELB/ALB), and Content Delivery Networks (CDN).
Technical Stack :
- Cloud : AWS, Microsoft Azure
- IaC & Config : Terraform, Ansible, CloudFormation
- Orchestration : Kubernetes (EKS, AKS), Helm, Docker
- CI/CD : Jenkins, GitLab CI, Azure DevOps, ArgoCD
- Monitoring : Prometheus, Grafana, ELK Stack, Datadog
- Version Control : Git, Bitbucket, GitHub
- Security : HashiCorp Vault, Snyk, SonarQube
Preferred Skills & Advanced Qualifications :
- Service Mesh : Experience implementing Istio or Linkerd for microservices traffic management, security, and observability.
- Serverless Architecture : Proven experience managing serverless workloads and event-driven architectures (AWS Lambda/Azure Functions).
- Advanced DevSecOps : Familiarity with OPA (Open Policy Agent) for "Policy as Code" and automated compliance auditing.
- Cost Optimization : Track record of significantly reducing cloud spend through right-sizing, Spot instances, and architectural improvements.
- Certifications : AWS Certified DevOps Engineer Professional, Azure DevOps Engineer Expert, or CKA (Certified Kubernetes Administrator)
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1592034
Interview Questions for you
View All