Posted on: 21/04/2026
Description :
We are looking for an experienced Site Reliability Engineer (SRE) to join a high-impact team working on mission-critical infrastructure. The role involves managing and optimizing on-premise systems, ensuring high availability, system reliability, and performance in a fast-paced production environment.
Key Skills Required :
- Strong experience in Linux Administration
- Hands-on experience with Docker (Non-Kubernetes / Bare Metal)
- Good knowledge of Nginx, Redis, and Kafka
- Experience with Jenkins, Git, Ansible, and Shell Scripting
- Familiarity with ELK Stack, Grafana, and Prometheus
- Good understanding of Networking, Firewalls / WAF, and Akamai
- Experience in Monitoring, Incident Management, RCA, and Capacity Planning
- Working knowledge of RDBMS and Java
Good to Have :
- Experience in Payments / UPI domain
- Exposure to High TPS systems
- Understanding of Low Latency Architecture
- Knowledge of HA / DR design patterns
- Familiarity with PCI-DSS / RBI guidelines
Core Skills : Linux Administration | Docker | Nginx | Redis | Kafka | Jenkins | Git | Ansible | Shell Scripting | ELK | Grafana | Prometheus | Networking | Firewalls/WAF | Akamai | RCA | Capacity Planning | RDBMS | Java
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1629892