Description :

We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI/CD automation to lead cloud infrastructure initiatives.

The ideal candidate will design and implement robust CI/CD pipelines, automate deployments, ensure platform reliability, and drive continuous improvement in cloud operations and DevOps practices.

Key Responsibilities :

- Design, develop, and optimize end-to-end CI/CD pipelines using Jenkins, with a strong focus on Declarative Pipeline syntax.

- Automate deployment, scaling, and management of applications across various GCP services including GKE, Cloud Run, Compute Engine, Cloud SQL, Cloud Storage, VPC, and Cloud Functions.

- Collaborate closely with development and DevOps teams to ensure seamless integration of applications into the CI/CD pipeline and GCP environment.

- Implement and manage monitoring, logging, and alerting solutions to maintain visibility, reliability, and performance of cloud infrastructure and applications.

- Ensure compliance with security best practices and organizational policies across GCP environments.

- Document processes, configurations, and architectural decisions to maintain operational transparency.

- Stay updated with the latest GCP services, DevOps, and SRE best practices to enhance infrastructure efficiency and reliability.

Mandatory Skills :

- Google Cloud Platform (GCP) Hands-on experience with core GCP compute, networking, and storage services.

- Jenkins Expertise in Declarative Pipeline creation and optimization.

- CI/CD Strong understanding of automated build, test, and deployment workflows.

- Solid understanding of SRE principles including automation, scalability, observability, and system reliability.

- Familiarity with containerization and orchestration tools (Docker, Kubernetes GKE).

- Proficiency in scripting languages such as Shell, Python, or Groovy for automation tasks.

Preferred Skills :

- Experience with Terraform, Ansible, or Cloud Deployment Manager for Infrastructure as Code (IaC).

- Exposure to monitoring and observability tools like Stackdriver, Prometheus, or Grafana.

- Knowledge of multi-cloud or hybrid environments (AWS experience is a plus)