HamburgerMenu
hirist

Job Description

Description :


We are looking for a Sr. Cloud Specialist with deep expertise in cloud agnostic services, DevOps
practices, and infrastructure automation to lead our cloud transformation initiatives. This role is
critical in building and scaling our cloud-native travel platform infrastructure for ensuring high availability, security, and operational excellence.



Key Responsibilities :

Cloud Architecture & Design : Design, implement, and manage scalable, secure, and cost-optimized AWS cloud infrastructure supporting microservices architecture and serverless applications.



Infrastructure as Code (IaC) : Develop and maintain infrastructure automation using Terraform, Ansible, AWS CloudFormation, ymal, and CDK. Implement version-controlled infrastructure templates following best practices.



CI/CD Pipeline Management : Build, optimize, and maintain robust CI/CD pipelines using Azure Pipelines, GitHub Actions, and for automated build, test, and deployment workflows.



Container Orchestration : Manage containerized workloads using Docker, Amazon ECS, EKS (Kubernetes), Lambda, and implement service mesh patterns for microservices communication.



Serverless Architecture : Design and deploy serverless solutions using AWS Lambda, API Gateway, Step Functions, SQS, SNS, and EventBridge, Apache Pulser, HashiCorp Vault, Consul for event-driven architectures.



Monitoring & Observability : Implement comprehensive monitoring and alerting using OpenTelemetry, Prometheus, Grafana, AWS CloudWatch, and Elasticsearch/ELK Stack. Establish SLIs, SLOs, and SLAs.



Security & Compliance : Implement security best practices including IAM policies, security groups, VPC design, encryption (KMS), secrets management (AWS Secrets Manager), and compliance frameworks (SOC2, PCI-DSS).



Database Management : Manage and optimize RDS (PostgreSQL), DynamoDB, MongoDB, Redis/ ElastiCache, and QLDB for various application needs.



Performance Optimization : Conduct performance tuning, cost optimization, and capacity planning. Implement auto-scaling, load balancing, and caching strategies.



Disaster Recovery & Backup : Design and implement backup strategies, disaster recovery plans, and business continuity solutions with defined RTOs and RPOs.



Collaboration & Mentorship : Work closely with engineering, product, and QA teams to enable DevOps culture. Mentor junior DevOps engineers and conduct knowledge-sharing sessions.



- Incident Management : Lead incident response, root cause analysis, and implement
post-mortem learnings to improve system reliability.



Required Qualifications :

- Education : Bachelor's degree in Computer Science, Engineering, or equivalent technical discipline.



- Experience : 6-10 years of total experience, with at least 5 years in AWS cloud and DevOps roles.



AWS Expertise :

- Deep hands-on experience with AWS core services : EC2, S3, VPC, IAM, CloudFront, Route53, ALB/NLB



- Strong knowledge of AWS Lambda, API Gateway, Step Functions, SQS, SNS, EventBridge



- Experience with ECS, EKS (Kubernetes), ECR, and container orchestration



- Proficiency in RDS (PostgreSQL), DynamoDB, DocumentDB, ElastiCache (Redis), QLDB



- Understanding of AWS networking, security groups, NACLs, VPC peering, Transit Gateway



- Experience with CloudWatch, X-Ray, CloudTrail, Config, and Systems Manager



DevOps & Automation :

- Strong expertise in Infrastructure as Code : Terraform (required), CloudFormation, or AWS CDK



- Hands-on experience with CI/CD tools : Azure Pipelines, GitHub Actions, Jenkins, or GitLab CI



- Proficiency in scripting : Python, Bash, PowerShell



- Experience with configuration management tools (Ansible preferred)



Containerization & Orchestration :

- Deep knowledge of Docker, container optimization, multi-cloud containers, and multi-stage builds



- Hands-on experience with Kubernetes (EKS) or Amazon ECS/Fargate



- Understanding of Helm charts, service mesh (Istio/App Mesh), and ingress controllers



Monitoring & Observability :

- Experience implementing OpenTelemetry, Prometheus, Grafana, and Elasticsearch/ELK Stack



- Strong knowledge of distributed tracing, metrics collection, and log aggregation



- Ability to create meaningful dashboards, alerts, and SLO-based monitoring



Security & Compliance :

- Strong understanding of AWS security best practices and compliance frameworks



- Experience with IAM policies, RBAC, secrets management, and encryption strategies



- Knowledge of security scanning tools (SonarCloud, Trivy, AWS Inspector)



Methodologies & Tools :

- Strong command of Git, Git workflows, and branching strategies



- Experience with Agile/Scrum methodologies



- Familiarity with cost optimization tools (AWS Cost Explorer, Trusted Advisor)



Soft Skills :

- Excellent troubleshooting and problem-solving abilities



- Strong communication and documentation skills



- Ability to work in fast-paced environments and manage multiple priorities



- Proactive approach to identifying and resolving issues



Preferred Skills :

- AWS Certifications : AWS Certified Solutions Architect (Professional), AWS Certified
DevOps Engineer, or AWS Certified Security Specialty.



- Experience with multi-cloud or hybrid cloud environments (Azure familiarity is a plus)



- Knowledge of GitOps practices (ArgoCD, FluxCD)



- Experience with chaos engineering and resilience testing



- Familiarity with FinOps practices and cloud cost optimization



- Background in travel, e-commerce, or high-traffic consumer platforms



- Experience with AWS Lambda performance optimization and cold start reduction



- Knowledge of API Gateway patterns, throttling, and usage plan


info-icon

Did you find something suspicious?