HamburgerMenu
hirist

Rosemallow Technologies - Site Reliability Engineer - Observability Services

ROSEMALLOW TECHNOLOGIES PRIVATE LIMITED
Coimbatore
3 - 5 Years

Posted on: 31/07/2025

Job Description

Job Title : Site Reliability Engineer (SRE)

Location : Coimbatore, Pune

Interview Mode : 2 rounds (F2F)

Department : Technology / Infrastructure / DevOps

Employment Type : Full-time

Job Summary :

We are seeking an experienced Site Reliability Engineer (SRE) who will play a critical role in ensuring the reliability, performance, and scalability of our payment systems.

The ideal candidate will possess deep expertise in DevOps automation, enterprise monitoring, and cloud platforms, along with a strong background in Card Payment systems.

This role requires hands-on technical skills, a passion for problem-solving, and the ability to collaborate across teams in a fast-paced, dynamic environment.

Key Responsibilities :

Reliability & Performance :


- Ensure the reliability, availability, and performance of critical payment platforms and services.

- Drive root cause analysis (RCA) and implement long-term solutions to prevent recurrence of incidents.

- Manage capacity planning, scalability, and performance tuning across cloud and on-prem environments.

- Lead and participate in the on-call rotation, providing timely support and issue resolution.

DevOps Automation & CI/CD :


- Design, implement, and maintain CI/CD pipelines using Jenkins, GitHub, and other DevOps tools.

- Automate infrastructure deployment, configuration, and monitoring, following Infrastructure as Code (IaC) principles.

- Enhance automation for routine operational tasks, incident response, and self-healing capabilities.

Monitoring & Observability :

- Implement and manage enterprise monitoring solutions including Splunk, Dynatrace, Prometheus, and Grafana.

- Build real-time dashboards, alerts, and reporting to proactively identify system anomalies.

- Continuously improve observability, logging, and tracing across all environments.

Cloud Platforms & Infrastructure :

- Work with AWS, Azure, and PCF (Pivotal Cloud Foundry) environments, managing cloud-native services and infrastructure.

- Design and optimize cloud architecture for reliability and cost-efficiency.

- Collaborate with cloud security and networking teams to ensure secure and compliant infrastructure.

Payment Systems Expertise :

- Apply your understanding of Card Payment systems to ensure platform reliability and compliance.

- Troubleshoot payment-related issues, ensuring minimal impact on transaction flows and customer experience.

- Collaborate with product and development teams to ensure alignment with business objectives.


info-icon

Did you find something suspicious?