HamburgerMenu
hirist

Site Reliability Engineer - Google Cloud Platform

Techolution
Anywhere in India/Multiple Locations
5 - 7 Years

Posted on: 24/09/2025

Job Description

Role : Google Cloud SRE Engineer

We are seeking an exceptional Google Cloud SRE Engineer to join our engineering team.

This role requires a highly skilled professional with deep expertise in Google Cloud Platform (GCP), Kubernetes, Infrastructure as Code, and CI/CD automation.

The ideal candidate thrives in high-pressure production environments, excels at automation, and continuously drives improvements in system reliability, scalability, and operational efficiency.

Title : Google Cloud SRE Engineer.

Location : Remote Work.

Employment Type : Full Time.

No of Openings : 2.

Timings : 24-7 (rotational Shifts).

Key Responsibilities :

- Ensure the reliability, availability, and performance of production systems hosted on GCP.

- Lead incident response and troubleshooting efforts for critical production issues.

- Perform root cause analysis and implement long-term fixes to prevent recurrence.

- Champion monitoring, alerting, and observability practices to enhance system resilience.

Programming & Automation

- Develop and maintain automation tools, scripts, and services using Python, Go, and Bash.

- Identify repetitive operational tasks and convert them into automated workflows.

- Build scalable, robust solutions to reduce operational toil and improve reliability.

Google Cloud Platform (GCP)

- Architect, deploy, and optimize production-grade workloads on GCP.

- Ensure adherence to GCP best practices, cost optimization strategies, and security compliance.

- Continuously evaluate and adopt emerging GCP services to enhance cloud operations.

Kubernetes (GKE)

- Manage and optimize large-scale GKE clusters.

- Implement deployment strategies, resource management, and cluster security.

- Troubleshoot complex issues in containerized workloads and cluster environments.

CI/CD & Infrastructure as Code

- Design, implement, and maintain CI/CD pipelines using Jenkins, GitLab CI, or GitHub Actions.

- Define and manage cloud infrastructure using Terraform, including reusable and modular configurations.

- Collaborate with developers to ensure seamless integration and automated testing.

Required Skills & Experience :

- Programming/Scripting : Expert in Python, Go, and Bash with proven automation portfolio.

- GCP : 2+ years of hands-on GCP experience with deep understanding of its services and architecture.

- Kubernetes (GKE) : Advanced experience in managing production clusters, deployments, and troubleshooting.

- CI/CD : Strong expertise with Jenkins, GitLab CI, or GitHub Actions; proven history of building enterprise-grade pipelines.

- Terraform : Proficiency in Infrastructure as Code with Terraform, including reusable and modular configurations.

- Incident Response : Demonstrated excellence in handling critical production incidents and performing RCA.

- Automation-First Mindset : Consistent track record of converting manual tasks into automated workflows.

- AI Integration : Awareness and experience in applying AI/ML tools in DevOps practices is a strong plus.

Preferred Qualifications

- GCP Professional Cloud DevOps Engineer or Architect certification.

- Experience with monitoring/observability tools (Prometheus, Grafana, ELK, Stackdriver).

- Exposure to service mesh technologies (Istio, Linkerd).

- Familiarity with security practices such as IAM, workload identity, and secrets management.


info-icon

Did you find something suspicious?