Posted on: 10/12/2025
Description :
The DevOps Manager will lead the DevOps and Infrastructure initiatives to ensure high availability, scalability, and security of all systems and applications. This role will oversee cloud infrastructure management, CI/CD pipelines, application reliability, automation frameworks, and security hardening. The manager will work closely with development, QA, and SRE teams to maintain operational excellence across all environments.
Key Responsibilities :
1. Infrastructure & Cloud Operations :
- Manage and optimize AWS infrastructure, including EC2, ECS, EFS, RDS (Aurora), ALB/NLB, CloudFront, and WAF.
- Lead cost optimization efforts right-sizing, housekeeping unused resources, snapshot management, and tagging compliance.
- Ensure high availability, fault tolerance, and data protection across all services.
- Implement infrastructure-as-code (IaC) using Terraform for consistency and scalability.
2. CI/CD & Automation :
- Oversee Jenkins pipelines for build, test, and deployment automation across multiple environments.
- Maintain consistent and reliable image versioning and rollout strategy to prevent cross-environment conflicts.
- Integrate & Manage SonarQube for static code analysis and quality gates within pipelines.
- Enable blue/green or canary deployment strategies for zero-downtime releases.
3. Security & Compliance :
- Implement best practices for cloud and container security (IAM policies, Secrets Manager, security groups, VPC boundaries).
- Lead vulnerability assessment, remediation & regular patch management for both OS- and application-level risks.
- Support SOC 2 and internal compliance audits through evidence collection and control validation.
4. Monitoring, Observability & Incident Management :
- Set up and maintain observability stack (CloudWatch, ELK) and implement proactive alerting and escalation policies.
- Conduct RCA (Root Cause Analysis) and post-mortem reviews for incidents.
- Drive continuous improvement in incident response, recovery, and reliability.
5. Team Leadership & Governance :
- Lead a team of DevOps and SRE engineers; mentor and upskill junior team members.
- Define and enforce DevOps best practices and operational standards.
- Establish documentation, playbooks, and runbooks for operational consistency.
Preferred Skill Sets :
Technical Skills :
- Cloud : AWS (ECS, EC2, EFS, RDS, SQS, S3, ALB, CloudFront, WAF, IAM, CloudWatch)
- Containerization : Docker, ECS Fargate, ECR
- IaC & Automation : Terraform, CloudFormation
- CI/CD : Jenkins, SonarQube, Artifact management (ECR)
- Monitoring & Logging : ELK Stack, CloudWatch
- Databases : Aurora MySQL, DynamoDB, Amazon Elasticache for Redis
- Security : AWS KMS, IAM Policies, Vulnerability Management, Network ACLs
- Scripting : Bash, Python (for automation scripts and API-based integrations)
- Networking : VPC design, Load Balancers, DNS (Route53)
Soft & Leadership Skills :
- Strong communication and collaboration skills across cross-functional teams.
- Experience conducting RCA and leading post-incident reviews.
- Data-driven mindset with focus on metrics like uptime, MTTR, and deployment frequency.
- Ability to balance reliability with delivery velocity.
- Mentoring engineers in automation, cloud operations, and security best practices.
Preferred Experience :
- 12+ years of overall experience in DevOps/SRE, with at least 2 years in a managerial or lead role.
- Exposure to microservices-based architectures and API-first systems.
- Experience with compliance frameworks (SOC 2, ISO 27001, etc.) is a plus.
- Certifications : AWS Certified Solutions Architect Associate (required), AWS Certified DevOps Engineer Professional (preferred), AWS Certified Solutions Architect Professional (bonus)
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1587744
Interview Questions for you
View All