HamburgerMenu
hirist

Job Description

Description :

The DevOps Manager will lead the DevOps and Infrastructure initiatives to ensure high availability, scalability, and security of all systems and applications. This role will oversee cloud infrastructure management, CI/CD pipelines, application reliability, automation frameworks, and security hardening. The manager will work closely with development, QA, and SRE teams to maintain operational excellence across all environments.

Key Responsibilities :

1. Infrastructure & Cloud Operations :

- Manage and optimize AWS infrastructure, including EC2, ECS, EFS, RDS (Aurora), ALB/NLB, CloudFront, and WAF.

- Lead cost optimization efforts right-sizing, housekeeping unused resources, snapshot management, and tagging compliance.

- Ensure high availability, fault tolerance, and data protection across all services.

- Implement infrastructure-as-code (IaC) using Terraform for consistency and scalability.

2. CI/CD & Automation :

- Oversee Jenkins pipelines for build, test, and deployment automation across multiple environments.

- Maintain consistent and reliable image versioning and rollout strategy to prevent cross-environment conflicts.

- Integrate & Manage SonarQube for static code analysis and quality gates within pipelines.

- Enable blue/green or canary deployment strategies for zero-downtime releases.

3. Security & Compliance :

- Implement best practices for cloud and container security (IAM policies, Secrets Manager, security groups, VPC boundaries).

- Lead vulnerability assessment, remediation & regular patch management for both OS- and application-level risks.

- Support SOC 2 and internal compliance audits through evidence collection and control validation.

4. Monitoring, Observability & Incident Management :

- Set up and maintain observability stack (CloudWatch, ELK) and implement proactive alerting and escalation policies.

- Conduct RCA (Root Cause Analysis) and post-mortem reviews for incidents.

- Drive continuous improvement in incident response, recovery, and reliability.

5. Team Leadership & Governance :

- Lead a team of DevOps and SRE engineers; mentor and upskill junior team members.

- Define and enforce DevOps best practices and operational standards.

- Establish documentation, playbooks, and runbooks for operational consistency.

Preferred Skill Sets :

Technical Skills :

- Cloud : AWS (ECS, EC2, EFS, RDS, SQS, S3, ALB, CloudFront, WAF, IAM, CloudWatch)

- Containerization : Docker, ECS Fargate, ECR

- IaC & Automation : Terraform, CloudFormation

- CI/CD : Jenkins, SonarQube, Artifact management (ECR)

- Monitoring & Logging : ELK Stack, CloudWatch

- Databases : Aurora MySQL, DynamoDB, Amazon Elasticache for Redis

- Security : AWS KMS, IAM Policies, Vulnerability Management, Network ACLs

- Scripting : Bash, Python (for automation scripts and API-based integrations)

- Networking : VPC design, Load Balancers, DNS (Route53)

Soft & Leadership Skills :

- Strong communication and collaboration skills across cross-functional teams.

- Experience conducting RCA and leading post-incident reviews.

- Data-driven mindset with focus on metrics like uptime, MTTR, and deployment frequency.

- Ability to balance reliability with delivery velocity.

- Mentoring engineers in automation, cloud operations, and security best practices.

Preferred Experience :

- 12+ years of overall experience in DevOps/SRE, with at least 2 years in a managerial or lead role.

- Exposure to microservices-based architectures and API-first systems.

- Experience with compliance frameworks (SOC 2, ISO 27001, etc.) is a plus.

- Certifications : AWS Certified Solutions Architect Associate (required), AWS Certified DevOps Engineer Professional (preferred), AWS Certified Solutions Architect Professional (bonus)


info-icon

Did you find something suspicious?