Posted on: 23/10/2025
Description :
We are looking for a Sr. Cloud Specialist with deep expertise in cloud agnostic services, DevOps
practices, and infrastructure automation to lead our cloud transformation initiatives. This role is
critical in building and scaling our cloud-native travel platform infrastructure for ensuring high availability, security, and operational excellence.
Key Responsibilities :
Cloud Architecture & Design : Design, implement, and manage scalable, secure, and cost-optimized AWS cloud infrastructure supporting microservices architecture and serverless applications.
Infrastructure as Code (IaC) : Develop and maintain infrastructure automation using Terraform, Ansible, AWS CloudFormation, ymal, and CDK. Implement version-controlled infrastructure templates following best practices.
CI/CD Pipeline Management : Build, optimize, and maintain robust CI/CD pipelines using Azure Pipelines, GitHub Actions, and for automated build, test, and deployment workflows.
Container Orchestration : Manage containerized workloads using Docker, Amazon ECS, EKS (Kubernetes), Lambda, and implement service mesh patterns for microservices communication.
Serverless Architecture : Design and deploy serverless solutions using AWS Lambda, API Gateway, Step Functions, SQS, SNS, and EventBridge, Apache Pulser, HashiCorp Vault, Consul for event-driven architectures.
Monitoring & Observability : Implement comprehensive monitoring and alerting using OpenTelemetry, Prometheus, Grafana, AWS CloudWatch, and Elasticsearch/ELK Stack. Establish SLIs, SLOs, and SLAs.
Security & Compliance : Implement security best practices including IAM policies, security groups, VPC design, encryption (KMS), secrets management (AWS Secrets Manager), and compliance frameworks (SOC2, PCI-DSS).
Database Management : Manage and optimize RDS (PostgreSQL), DynamoDB, MongoDB, Redis/ ElastiCache, and QLDB for various application needs.
Performance Optimization : Conduct performance tuning, cost optimization, and capacity planning. Implement auto-scaling, load balancing, and caching strategies.
Disaster Recovery & Backup : Design and implement backup strategies, disaster recovery plans, and business continuity solutions with defined RTOs and RPOs.
Collaboration & Mentorship : Work closely with engineering, product, and QA teams to enable DevOps culture. Mentor junior DevOps engineers and conduct knowledge-sharing sessions.
- Incident Management : Lead incident response, root cause analysis, and implement
post-mortem learnings to improve system reliability.
Required Qualifications :
- Education : Bachelor's degree in Computer Science, Engineering, or equivalent technical discipline.
- Experience : 6-10 years of total experience, with at least 5 years in AWS cloud and DevOps roles.
AWS Expertise :
- Deep hands-on experience with AWS core services : EC2, S3, VPC, IAM, CloudFront, Route53, ALB/NLB
- Strong knowledge of AWS Lambda, API Gateway, Step Functions, SQS, SNS, EventBridge
- Experience with ECS, EKS (Kubernetes), ECR, and container orchestration
- Proficiency in RDS (PostgreSQL), DynamoDB, DocumentDB, ElastiCache (Redis), QLDB
- Understanding of AWS networking, security groups, NACLs, VPC peering, Transit Gateway
- Experience with CloudWatch, X-Ray, CloudTrail, Config, and Systems Manager
DevOps & Automation :
- Strong expertise in Infrastructure as Code : Terraform (required), CloudFormation, or AWS CDK
- Hands-on experience with CI/CD tools : Azure Pipelines, GitHub Actions, Jenkins, or GitLab CI
- Proficiency in scripting : Python, Bash, PowerShell
- Experience with configuration management tools (Ansible preferred)
Containerization & Orchestration :
- Deep knowledge of Docker, container optimization, multi-cloud containers, and multi-stage builds
- Hands-on experience with Kubernetes (EKS) or Amazon ECS/Fargate
- Understanding of Helm charts, service mesh (Istio/App Mesh), and ingress controllers
Monitoring & Observability :
- Experience implementing OpenTelemetry, Prometheus, Grafana, and Elasticsearch/ELK Stack
- Strong knowledge of distributed tracing, metrics collection, and log aggregation
- Ability to create meaningful dashboards, alerts, and SLO-based monitoring
Security & Compliance :
- Strong understanding of AWS security best practices and compliance frameworks
- Experience with IAM policies, RBAC, secrets management, and encryption strategies
- Knowledge of security scanning tools (SonarCloud, Trivy, AWS Inspector)
Methodologies & Tools :
- Strong command of Git, Git workflows, and branching strategies
- Experience with Agile/Scrum methodologies
- Familiarity with cost optimization tools (AWS Cost Explorer, Trusted Advisor)
Soft Skills :
- Excellent troubleshooting and problem-solving abilities
- Strong communication and documentation skills
- Ability to work in fast-paced environments and manage multiple priorities
- Proactive approach to identifying and resolving issues
Preferred Skills :
- AWS Certifications : AWS Certified Solutions Architect (Professional), AWS Certified
DevOps Engineer, or AWS Certified Security Specialty.
- Experience with multi-cloud or hybrid cloud environments (Azure familiarity is a plus)
- Knowledge of GitOps practices (ArgoCD, FluxCD)
- Experience with chaos engineering and resilience testing
- Familiarity with FinOps practices and cloud cost optimization
- Background in travel, e-commerce, or high-traffic consumer platforms
- Experience with AWS Lambda performance optimization and cold start reduction
- Knowledge of API Gateway patterns, throttling, and usage plan
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1563312
Interview Questions for you
View All