HamburgerMenu
hirist

Job Description

Job Purpose :


We are modernizing our Cloud Operations practice moving away from manual, console-driven work toward a fully automated, infrastructure-as-code culture. This role is the catalyst for that transformation.


You will join a team based in Bangalore and serve as the technical anchor for our SRE/DevOps evolution. You are the kind of engineer who sees a manual process and immediately thinks about how to eliminate it. You are comfortable in both Windows and Linux environments, fluent in AWS, and passionate about treating infrastructure the same way developers treat application code versioned, tested, and deployed via pipelines.


Key Responsibilities :


Infrastructure automation & IaC :


- Lead the migration from ClickOps to fully automated, code-driven infrastructure provisioning using Terraform


- Build and maintain reusable Terraform modules for AWS resources (VPCs, EC2, EKS, RDS, Aurora, IAM, and more)


- Develop and manage Ansible playbooks for configuration management across Windows and Linux fleets


- Champion the principle that every infrastructure change flows through code and version control no exceptions


GitOps and CI/CD :


- Design and implement GitOps workflows using GitHub actions and/or ArgoCD for infrastructure and application deployments


- Establish branch protection, code review gates, and automated validation pipelines for infrastructure PRs


- Define and enforce GitOps standards across the team, onboarding colleagues to the new way of working


Cloud Operations & SRE :


- Own reliability, availability and scalability of cloud infrastructure across AWS environments


- Define and track SLOs/SLIs; build alerting, dashboards and runbooks to support operational excellence


- Respond to and lead post-mortems on production incidents, drive systemic fixes to eliminate recurrence


- Implement cost optimization strategies and enforce tagging/governance policies


Enablement and Knowledge Transfer :


- Mentor Cloud Operations team members, building internal capacity in IaC and automation practices


- Document patterns, standards and architectural decisions; make knowledge accessible to the broader team


- Act as a force multiplier your impact is measured not just by what you build, but by what the team is able to build after you


Skills & Experience :


Core Infrastructure :


- Strong hands-on experience with both Linux (Amazon Linux, RHEL) and Windows Server administration


- Deep AWS proficiency : compute, networking, storage, security and managed services


- Experience managing hybrid or mixed OS environments at scale


Automation and IaC :


- Terraform : module development, state management, remote backends, workspace strategies


- Ansible : playbook authoring, roles, inventories, vault integration, Windows WinRM support, Amazon SSM


- Strong scripting skills in Bash and Powershell, Python is a strong plus.


GitOps and Version Control :


- GitHub : advanced branching, Actions workflows, protected branches, secrets management


- Practical experience implementing GitOps patterns (pull-based deployments, declarative configuration, drift detection)


- Familiarity with tools such as ArgoCD, Flux, or equivalent


Reliability Engineering :


- Experience defining SLOs, building dashboards and managing alerts (CloudWatch, Prometheus/VictoriaMetrics, or similar)


- Comfortable owning on-call rotations and driving incident response and post-mortems

info-icon

Did you find something suspicious?

Similar jobs that you might be interested in