Posted on: 14/10/2025
Description:
Key Responsibilities:
Stakeholder & Project Management:
- Manage communication with technical and business stakeholders, ensuring clear alignment of DR objectives.
- Own project timelines, reporting, and governance activities to drive successful DR implementations.
DR Strategy & Implementation:
- Lead the end-to-end design and delivery of Disaster Recovery (DR) strategies on AWS aligned with business continuity and compliance mandates.
- Define and implement RTO (Recovery Time Objective) and RPO (Recovery Point Objective) targets.
Infrastructure Automation:
- Oversee infrastructure provisioning and DR automation using tools such as Terraform, GitLab/GitHub CI/CD, and Ansible.
- Ensure high availability, scalability, and fault tolerance of critical applications and data.
Testing & Readiness:
- Conduct failover testing, disaster recovery drills, and validation to ensure operational preparedness.
- Continuously assess and improve DR procedures and incident response capabilities.
Technical Oversight & Mentorship:
- Provide hands-on support for AWS services including:
VPC, EC2, RDS, Lambda, KMS, IAM, Route 53, and other core services.
- Mentor and guide engineering teams on DR architecture and best practices.
Required Skills & Experience:
- 8+ years of overall IT infrastructure or cloud engineering experience.
- 4+ years of hands-on experience designing and implementing DR solutions on AWS.
- Proficiency with Infrastructure as Code (IaC) tools: Terraform, Ansible.
- Experience with CI/CD pipelines using GitLab, GitHub Actions, or similar.
- Strong understanding of cloud networking, encryption (KMS), identity & access management, and data replication.
- Solid understanding of DR frameworks, compliance standards, and business continuity planning.
- Excellent communication, documentation, and project coordination skills.
Preferred Qualifications:
- AWS Certified Solutions Architect Professional or equivalent.
- Familiarity with multi-region AWS architecture and cross-region replication.
- Experience with regulated environments (e.g., finance, healthcare, etc.).
- Exposure to incident response automation and chaos engineering tools (e.g., AWS Fault Injection Simulator).
Did you find something suspicious?
Posted By
Posted in
CyberSecurity
Functional Area
IT Security
Job Code
1560705
Interview Questions for you
View All