Posted on: 10/03/2026
Job Title : Site Reliability Engineer (SRE)
Job Summary :
We are seeking a skilled Site Reliability Engineer (SRE) to manage and improve system reliability, scalability, and performance. The ideal candidate will have strong experience in cloud platforms, container orchestration, CI/CD pipelines, and monitoring tools. This role focuses on automation, observability, and maintaining highly available production environments.
Key Responsibilities :
- Manage and maintain Kubernetes-based infrastructure for container orchestration.
- Deploy and manage applications using Helm packaging.
- Build and maintain CI/CD pipelines using GitLab CI/CD, Azure DevOps, and Jenkins.
- Automate infrastructure provisioning using Terraform.
- Implement monitoring and observability using tools like Dynatrace, Datadog, and Grafana.
- Manage log aggregation and analysis using the EFK stack (Elasticsearch, Fluentd, Kibana).
- Ensure system reliability, availability, and performance across Azure and AWS cloud platforms.
- Maintain and optimize database systems such as PostgreSQL and IBM DB2.
- Manage source code repositories using Git.
- Develop automation scripts using Bash.
- Troubleshoot production issues and improve system resilience.
Required Skills :
- Strong experience with Kubernetes and container orchestration.
- Hands-on experience with Helm.
- Expertise in CI/CD tools such as GitLab CI/CD, Azure DevOps, and Jenkins.
- Experience with Infrastructure as Code (Terraform).
- Knowledge of monitoring and observability tools (Dynatrace, Datadog, Grafana).
- Experience with log analytics using EFK stack.
- Experience working with cloud platforms such as Azure and AWS.
- Familiarity with databases like PostgreSQL and IBM DB2.
- Strong scripting skills in Bash.
- Proficiency in Git for version control.
Preferred Skills :
- Experience in DevOps and SRE practices.
- Knowledge of system reliability, scalability, and high availability architecture.
- Strong troubleshooting and problem-solving abilities.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1619110