HamburgerMenu
hirist

Site Reliability Engineer

Forward Eye Technologies
6 - 16 Years
Anywhere in India/Multiple Locations

Posted on: 10/03/2026

Job Description

Job Title : Site Reliability Engineer (SRE)


Job Summary :


We are seeking a skilled Site Reliability Engineer (SRE) to manage and improve system reliability, scalability, and performance. The ideal candidate will have strong experience in cloud platforms, container orchestration, CI/CD pipelines, and monitoring tools. This role focuses on automation, observability, and maintaining highly available production environments.


Key Responsibilities :


- Manage and maintain Kubernetes-based infrastructure for container orchestration.


- Deploy and manage applications using Helm packaging.


- Build and maintain CI/CD pipelines using GitLab CI/CD, Azure DevOps, and Jenkins.


- Automate infrastructure provisioning using Terraform.


- Implement monitoring and observability using tools like Dynatrace, Datadog, and Grafana.


- Manage log aggregation and analysis using the EFK stack (Elasticsearch, Fluentd, Kibana).


- Ensure system reliability, availability, and performance across Azure and AWS cloud platforms.


- Maintain and optimize database systems such as PostgreSQL and IBM DB2.


- Manage source code repositories using Git.


- Develop automation scripts using Bash.


- Troubleshoot production issues and improve system resilience.


Required Skills :


- Strong experience with Kubernetes and container orchestration.


- Hands-on experience with Helm.


- Expertise in CI/CD tools such as GitLab CI/CD, Azure DevOps, and Jenkins.


- Experience with Infrastructure as Code (Terraform).


- Knowledge of monitoring and observability tools (Dynatrace, Datadog, Grafana).


- Experience with log analytics using EFK stack.


- Experience working with cloud platforms such as Azure and AWS.


- Familiarity with databases like PostgreSQL and IBM DB2.


- Strong scripting skills in Bash.


- Proficiency in Git for version control.


Preferred Skills :


- Experience in DevOps and SRE practices.


- Knowledge of system reliability, scalability, and high availability architecture.


- Strong troubleshooting and problem-solving abilities.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in