Posted on: 31/07/2025
Job Title : Site Reliability Engineer (SRE)
Location : Coimbatore, Pune
Interview Mode : 2 rounds (F2F)
Department : Technology / Infrastructure / DevOps
Employment Type : Full-time
Job Summary :
We are seeking an experienced Site Reliability Engineer (SRE) who will play a critical role in ensuring the reliability, performance, and scalability of our payment systems.
The ideal candidate will possess deep expertise in DevOps automation, enterprise monitoring, and cloud platforms, along with a strong background in Card Payment systems.
This role requires hands-on technical skills, a passion for problem-solving, and the ability to collaborate across teams in a fast-paced, dynamic environment.
Key Responsibilities :
Reliability & Performance :
- Drive root cause analysis (RCA) and implement long-term solutions to prevent recurrence of incidents.
- Manage capacity planning, scalability, and performance tuning across cloud and on-prem environments.
- Lead and participate in the on-call rotation, providing timely support and issue resolution.
DevOps Automation & CI/CD :
- Automate infrastructure deployment, configuration, and monitoring, following Infrastructure as Code (IaC) principles.
- Enhance automation for routine operational tasks, incident response, and self-healing capabilities.
Monitoring & Observability :
- Implement and manage enterprise monitoring solutions including Splunk, Dynatrace, Prometheus, and Grafana.
- Build real-time dashboards, alerts, and reporting to proactively identify system anomalies.
- Continuously improve observability, logging, and tracing across all environments.
Cloud Platforms & Infrastructure :
- Work with AWS, Azure, and PCF (Pivotal Cloud Foundry) environments, managing cloud-native services and infrastructure.
- Design and optimize cloud architecture for reliability and cost-efficiency.
- Collaborate with cloud security and networking teams to ensure secure and compliant infrastructure.
Payment Systems Expertise :
- Apply your understanding of Card Payment systems to ensure platform reliability and compliance.
- Troubleshoot payment-related issues, ensuring minimal impact on transaction flows and customer experience.
- Collaborate with product and development teams to ensure alignment with business objectives.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1522006
Interview Questions for you
View All