HamburgerMenu
hirist

Site Reliability Engineer - Docker/Kubernetes

It's All About People Consulting OPC Pvt Ltd.
Bangalore
2 - 8 Years

Posted on: 21/09/2025

Job Description

About the Role :

You will be part of a team that helps develop secure and scalable infrastructure. Together with the other members of the team, you will be in charge of managing our Kubernetes clusters running in AWS, as well as various additional infrastructure, such as databases, logging, monitoring, networking and CI/CD pipeline.

You will also be helping our developers, by improving their build systems, deployment systems, and the custom tooling that is needed.

Finally you will be building tools and helping out the developers deliver value faster.


Essential Competencies :

- Collaborative Problem-Solving : Actively engage with team members to diagnose and resolve technical issues.

- Adaptive Learning : Demonstrate the ability to rapidly acquire knowledge and skills in unfamiliar areas to address emerging challenges.

- Proactive System Enhancement : Independently identify and present improvements for our roadmap that will optimize existing technological infrastructures.

- Proficiency in Linux Administration : Extensive experience with Debian systems.

- Kubernetes Expertise : Skilled in Kubernetes orchestration, preferably with Kops experience.

- Docker and AWS Proficiency : Solid background in utilizing Docker in AWS environments.

- Networking Acumen : Strong understanding and experience in network technologies and protocols.

- Alerting and Monitoring Skills : Competence in developing and managing monitoring and

alerting systems.

- Programming Skills : Proficiency in one or more relevant programming languages such as

Python or Go.


Desirable Additional Skills :


- Experience with Unified Logging : Familiarity with Graylog or similar platforms for log

management.

- Distributed Systems Knowledge : Understanding of the complexities and management of

distributed computing environments.

- Vulnerability Management : Experience in vulnerability scanning and monitoring.

- Prometheus for System Monitoring : Proficiency in using Prometheus for monitoring system

performance.

- CI/CD Expertise : Experience with Continuous Integration and Continuous Deployment

systems, such as CircleCI and ArgoCD.

- GitOps and Infrastructure as Code (IaC) : Familiarity with GitOps principles and IaC practices.


info-icon

Did you find something suspicious?