HamburgerMenu
hirist

Grootan Technologies - Site Reliability Engineer - Cloud Infrastructure

Grootan Technologies
Chennai
4 - 5 Years
star-icon
4.3white-divider34+ Reviews

Posted on: 11/12/2025

Job Description

About the Role :


We are looking for an experienced Site Reliability Engineer (SRE) with 45 years of hands-on experience to join our engineering team. In this role, you will be responsible for designing, building, and maintaining reliable, scalable, and secure infrastructure to support our applications.

You will leverage your expertise in automation, cloud technologies, and monitoring systems to ensure high availability, optimal performance, and overall platform resilience.

Key Responsibilities :

- Build and manage scalable, reliable, and secure cloud infrastructure.

- Develop automation tools to streamline deployment, monitoring, and operations.

- Monitor system performance, troubleshoot issues, and conduct root cause analysis.

- Implement and maintain CI/CD pipelines to support rapid and safe releases.

- Ensure adherence to SRE best practices, including SLIs, SLOs, and error budgets.

- Optimize infrastructure costs while maintaining high system reliability.

- Collaborate closely with development teams to enhance application reliability and performance.

- Maintain comprehensive documentation for systems, processes, and incident responses.

Required Skills & Experience :

- 4 - 5 years of experience as an SRE, DevOps Engineer, or similar role.

- Strong experience with cloud platforms such as AWS, Azure, or GCP.

- Proficiency in automation and scripting using Python, Bash, Go, or similar languages.

- Hands-on experience with containerization and orchestration tools (Docker, Kubernetes).

- Strong understanding of CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.

- Expertise in monitoring and logging tools (Prometheus, Grafana, ELK, Datadog, etc.

- Solid grasp of networking concepts, security best practices, and infrastructure-as-code (Terraform, CloudFormation).

- Strong analytical and problem-solving skills with a focus on system reliability and uptime


info-icon

Did you find something suspicious?