HamburgerMenu
hirist

Job Description

Location : Gurgaon-Hybrid.

About the Company :


We are a fast-growing messaging platform enabling real-time chat, group messaging, voice, video, and notifications for millions of users.

Our systems handle high throughput, low latency, and 247 availability at scale.

Role Overview :


We are looking for a Senior DevOps Engineer to build, scale, and operate highly available infrastructure for our messaging platform.


You will work closely with Engineering, SRE, and Security teams to ensure reliability, performance, and fast deployments.

Key Responsibilities :


Infrastructure & Cloud :


- Design, deploy, and maintain scalable infrastructure on AWS / GCP / Azure.

- Manage compute, networking, storage, and security for high-traffic systems.

- Optimize infrastructure for low latency and high availability.

CI/CD & Automation :


- Build and maintain robust CI/CD pipelines.

- Automate deployments, rollbacks, and environment provisioning.

- Improve developer productivity through tooling and automation.

Reliability & Observability :


- Ensure 99.9%+ uptime for messaging services.

- Set up monitoring, logging, and alerting (Prometheus, Grafana, ELK, Datadog, etc.

- Perform root cause analysis (RCA) and incident management.

Scalability & Performance :


- Support real-time workloads (chat, presence, notifications).

- Implement load balancing, autoscaling, and failover strategies.

Security & Compliance :


- Implement security best practices (IAM, secrets management, encryption).

- Ensure secure deployments and access controls.

- Support compliance and data protection requirements.

Required Skills & Qualifications :


- 5+ years of experience in DevOps / SRE.

- Strong experience with Linux, Bash, and scripting.

- Expertise in Docker and Kubernetes.

- Experience with Terraform / CloudFormation / IaC tools.

- Hands-on experience with cloud platforms (AWS/GCP/Azure).

- Strong understanding of networking, DNS, load balancing.

- Experience managing high-scale distributed systems.

Good to Have :


- Experience with Messaging / Chat / Real-time systems.

- Exposure to Kafka, RabbitMQ, Redis.

- Experience with Erlang / Go / Java backend platforms.

- Knowledge of Zero-downtime deployments.

- SRE practices (SLIs, SLOs, error budgets).

Key Systems Youll Work With :

- Messaging servers & real-time gateways.

- Databases & caches.

- CI/CD pipelines.

- Monitoring & alerting platforms.

- Security and access management.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in