HamburgerMenu
hirist

Site Reliability/DevOps Engineer - Cloud Infrastructure

Foundit
Chennai
6 - 10 Years

Posted on: 01/12/2025

Job Description

Job Title : SRE / DevOps Engineer and, Cloud Infrastructure DevOps Engineer

Location & Working Model

Chennai, India

On-site / Hybrid expected to work from office 4 days a week.

Shift timing : 12 : 00 PM to 9 : 00 PM

About Us / Team Context :

We are building a robust, scalable, and cloud-native infrastructure backbone to support our applications and services. We are seeking experienced and motivated DevOps / SRE professionals to join our infrastructure team. This is a critical role : you will be responsible for ensuring reliability, automation, scalability, and efficient deployment workflows, working across cloud environments, containers, orchestration, and infrastructure-as-code.

Role Overview :

As part of the infrastructure team, you will :

- For SRE / DevOps Engineer : ensure system reliability, uptime, scalability, automated deployments, and robust cloud-native operations.

- For Cloud Infrastructure DevOps Engineer : design, build, manage and maintain cloud infrastructure and CI/CD pipelines; manage container orchestration and deploy applications; automate deployments and infrastructure provisioning.

- You will collaborate closely with development, QA, security, and operations teams to deliver production-grade systems, enforce best practices, drive automation, and support deployments and releases.

Key Responsibilities

- Manage and provision cloud infrastructure (on AWS, Azure or GCP) using Infrastructure-as-Code (IaC) tools such as Terraform.

- Manage container orchestration and deployments using Kubernetes; use packaging tools such as Helm charts for application deployments.

- Implement GitOps / CI/CD workflows using tools such as Argo CD (or similar), Git, and version-control to enable automated, reproducible, reliable deployments.

- Write automation and scripting (e.g. in Python, Bash or similar) to support infrastructure tasks, deployment pipelines, configuration management, and operational workflows.

- Setup, manage, and maintain CI/CD pipelines that support continuous integration, delivery and deployment, ensuring fast and reliable releases.

- Ensure high availability, scalability, performance, security, and reliability of production systems and cloud infrastructure. Monitor, troubleshoot, do root-cause analysis (RCA) for incidents, and ensure minimal downtime.

- Collaborate with development, QA, security and operations teams to streamline deployment processes, enforce DevOps / SRE best practices, and support cross-team workflows.

- Maintain documentation, runbooks, best practices, and standards for infrastructure, deployments, and operational procedures.

- Participate in on-call rotations or shift-based operations (given the shift timing), ensuring coverage for production environments, incident response, and post-mortem analyses.

Required Skills & Experience

Total experience : 610 years in DevOps / SRE / Cloud Infrastructure / Platform Engineering roles with production-grade environments.

- Solid hands-on experience with cloud platforms : AWS, Azure or GCP.

- Proficiency in Infrastructure-as-Code (IaC) tools : Terraform (mandatory).

- Strong container orchestration and containerization experience : Kubernetes (production-grade), Helm charts, Docker or analogous technologies.

- Experience with Git / version control, and CI/CD / GitOps workflows including Git, Argo CD (or other GitOps), CI/CD tools (e.g. Jenkins, GitLab CI, GitHub Actions, etc.).

- Scripting / programming skills Python (preferred) or Bash / shell scripting for automation tasks.

- Good understanding of system reliability, monitoring, logging, observability, incident response, and cloud/wrapper security best practices.

- Strong problem-solving, troubleshooting skills, ability to perform root-cause analysis, and ensure production stability under shift-based working model.

- Excellent collaboration and communication skills, ability to work with cross-functional teams, and adapt to dynamic environments.


info-icon

Did you find something suspicious?