HamburgerMenu
hirist

DevOps/Infrastructure Engineer

Sugary
5 - 8 Years
Multiple Locations

Posted on: 15/04/2026

Job Description

Description :

- Observability & Reliability

- Implement monitoring, logging, and alerting using Prometheus + Grafana

- Debug production issues under pressure


About the Role :


Were hiring a founding DevOps / Infrastructure Engineer (IC) to build and scale our platform from scratch. You will own everything from first deployment


100K+ RPS scale, balancing speed, reliability, cost, and security.


This is a hands-on, high-ownership role - you will define the infrastructure, not just maintain it.


Build Infrastructure from Zero :

- Architect and deploy systems on Amazon Web Services / Google Cloud Platform

- Set up networking, compute, storage, IAM, and environments (dev/staging/prod)


Own CI/CD & Developer Experience :

- Build fast, reliable pipelines using GitHub Actions or similar

- Enable multiple deployments per day with zero downtime


Containerization & Runtime :

- Containerize services using Docker

- Decide when to introduce Kubernetes (and when not to)


Infrastructure as Code :

- Define reproducible infra using Terraform


Scale Systems (10K - 100K RPS) :

- Design for high throughput and low latency (<50ms)

- Introduce caching, async processing, and autoscaling

- Optimize system bottlenecks (DB, network, compute)


Security & Compliance :


- Implement authentication, authorization, and secrets management


- Build systems aligned with banking-grade principles (encryption, audit logs, least privilege)


Must-Have :

Strong fundamentals in :

- Linux, networking, HTTP

- Distributed systems basics (e.g. CAP theorem)

- Experience with at least one cloud platform

- Hands-on with CI/CD, containers, and infrastructure automation

- Ability to design systems, not just use tools


High-Impact Traits :


- You optimize for simplicity first, then scale


- You understand tradeoffs (latency vs consistency vs cost)

- You can debug real production failures

- You take full ownership of uptime and performance


Nice to Have :

Experience with :

- Distributed databases (e.g. Apache Cassandra, MongoDB)

- High-scale systems (10K+ RPS)

- Fintech / payments / security-sensitive systems

- Ship infrastructure that enables fast product iteration

- Scale from 0 ? 100K RPS without major rewrites

- Maintain high uptime and reliability

- Keep infrastructure costs efficient

- Build secure, production-grade systems


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in