Posted on: 19/09/2025
About Docsumo :
Docsumo is a Document Workflow platform that converts unstructured documents (like bank statements, financials, policies) into structured, actionable data with the help of Agentic Workflows.
Were backed by Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars and trusted by leading banks, insurers, and fintechs worldwide.
The opportunity as Senior DevOps / SRE Engineer :
Were looking for a Senior SRE (Python) to lead a small team (2 engineers) and own the reliability, deployment, and automation of our AI platform.
Youll work hands-on with Kubernetes, GCP, AWS, Python (Flask/FastAPI) and ensure our infrastructure and applications run securely, reliably, and at scale.
Key Responsibilities :
- Lead SRE initiatives and mentor 2 junior engineers
- Own deployments and monitoring across GCP (K8s, Cloud Run, VPC, networking) and AWS (Lambda, SES)
- Debug & fix issues in Python apps (Flask, FastAPI), with occasional Lua for canary deployments
- Set up automation, infra-as-code, CI/CD pipelines, and incident response
- Optimize for cost, performance, and reliability across infra and applications
- Work closely with backend engineers, product, and operations to keep our services running smoothly
Need to have :
- 4+ years in SRE/DevOps with strong Python scripting & backend debugging skills
- Hands-on with Kubernetes, Docker, and cloud infra (GCP & AWS)
- Experience with MongoDB, Elastic, monitoring tools (Prometheus, Grafana)
- Strong troubleshooting, debugging, and problem-solving skills
- Ability to lead small teams and drive reliability culture
Nice to have :
- Experience with Temporal, Redis, or serverless (Cloud Run, Lambda)
- Exposure to high-traffic SaaS or AI/ML infrastructure
- Prior team leadership/mentorship experience
Why join us?
- Lead the SRE charter and shape reliability for our platform
- Work on modern infra (K8s, Cloud-native, Temporal, serverless)
- High ownership, visible impact report directly to Engineering leadership
- Opportunity to grow into Principal Engineer / SRE Manager
- Fast-paced startup, strong learning curve, and a collaborative culture
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1549091
Interview Questions for you
View All