Posted on: 05/11/2025
About the Role :
We are looking for an experienced and proactive Site Reliability Engineer (SRE) to join our team in Bangalore. The ideal candidate will ensure system reliability, scalability, and performance through automation, monitoring, and incident management. This role requires hands-on experience with GCP, Terraform, GitHub, and monitoring tools, along with a strong problem-solving mindset and willingness to learn new technologies.
Key Responsibilities :
- Write and execute basic SQL queries in BigQuery to identify and debug issues.
- Work with GCP cloud infrastructure, understanding overall architecture, IAM roles, and security principles.
- Manage incidents and changes using ServiceNow, ensuring proper documentation and follow-up.
- Administer and maintain serverless GCP infrastructure, with hands-on experience in Terraform and GitHub for code versioning and release management.
- Monitor system health through Splunk and GCP alerts, focusing on early detection of symptoms rather than reactive outage responses.
- Track incidents from initiation to closure, maintaining clear and detailed Root Cause Analyses (RCAs).
- Collaborate with data and analytics teams to troubleshoot data quality issues and validate outputs in Power BI reports.
- Show eagerness to learn and work with new tools such as SSAS and dbt.
- Demonstrate strong teamwork, communication, and accountability in cross-functional environments.
Required Skills & Qualifications :
- Hands-on experience with Google Cloud Platform (GCP) architecture and services.
- Working knowledge of Terraform and GitHub for infrastructure as code and deployment management.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1570194
Interview Questions for you
View All