Posted on: 16/04/2026
Description :
We are looking for a highly skilled Site Reliability Engineer (SRE) with strong expertise in GitLab CI/CD to ensure reliability, scalability, and performance of our production systems.
The ideal candidate will work closely with DevOps, engineering, and infrastructure teams to build and maintain robust automation pipelines and cloud-native systems.
Key Responsibilities :
- Design, implement, and manage CI/CD pipelines using GitLab
- Ensure high availability, scalability, and performance of applications and infrastructure
- Monitor system health using observability tools and proactively resolve issues
- Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform
- Implement incident management, root cause analysis (RCA), and postmortems
- Collaborate with development teams to improve system reliability and deployment processes
- Manage containerized applications using Docker and Kubernetes
- Maintain SLAs, SLOs, and error budgets
- Drive continuous improvement in system resilience and automation
Required Skills & Qualifications :
- experience in SRE/DevOps roles
- Strong hands-on experience with GitLab CI/CD pipelines
- Proficiency in scripting languages (Python, Bash, or Go)
- Experience with cloud platforms (AWS, Azure, or GCP)
- Strong knowledge of containerization (Docker, Kubernetes)
- Experience with monitoring tools like Prometheus, Grafana, ELK stack
- Understanding of networking, security, and distributed systems
Preferred Skills :
- Experience with microservices architecture
- Knowledge of chaos engineering and reliability testing
- Familiarity with service mesh (Istio, Linkerd)
- Certification in cloud platforms or Kubernetes is a plus
Education:
- Bachelors/Masters degree in Computer Science, IT, or related field
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1628868