HamburgerMenu
hirist

Job Description

Job Title : Site Reliability Engineer (SRE)


Experience : 4 to 14 Years


Location : Hyderabad, India


Joining : Immediate Joiners Preferred


Job Summary :


We are seeking an experienced Site Reliability Engineer with a strong background in Observability, Troubleshooting, and IT Platform Management. The ideal candidate will play a critical role in maintaining high system reliability, performance, and scalability while working collaboratively across engineering, operations, and development teams.


Skills Required :


1. Site Reliability Engineering & Service Engineering :


- Strong experience in Observability : Logging, Monitoring, and Alerting (e.g., using tools like Prometheus, Grafana, ELK stack, Datadog, Splunk, etc.)


- Advanced skills in incident management, troubleshooting, and root cause analysis


- Expertise in improving system availability, performance, and reliability in production environments


- Automation of operational tasks and proactive infrastructure monitoring


2. IT Platforms & Hosting, Configuration & Integration :


- Deep understanding of platform usage, performance tuning, and system security requirements


- Implementation of secure hosting environments, ensuring system compliance with best practices


- Experience in infrastructure provisioning and configuration tools (e.g., Terraform, Ansible, Puppet, Chef)


- Ability to integrate and maintain enterprise-level applications and services


Desired Skills :


- Working knowledge of DuckCreek platform


- Proficiency in SQL and relational databases


- Familiarity with Batch Processing systems and schedulers


- Understanding of RESTful APIs and Web Services architecture and troubleshooting


Qualifications :


- Bachelors or Masters degree in Computer Science, Engineering, or related field


- Excellent analytical and problem-solving skills


- Strong communication and collaboration skills

info-icon

Did you find something suspicious?