Posted on: 08/04/2026
Responsibilities :
- Architect multi-tenant isolation models (tenant-per-cluster, namespace, and logical isolation).
- Design control plane + data plane separation for SaaS.
- Build tenant onboarding, provisioning, and lifecycle management.
- Implement scalable ingestion pipelines (1M+ events/day, streaming via Pulsar/Kafka).
- Design multi-tenant graph architecture (Neo4j) and storage strategies.
- Ensure RBAC, tenant-level security, and data isolation (IAM, Vault, encryption).
- Optimize cost, performance, and autoscaling across tenants.
- Enable SaaS observability, billing, metering, and usage tracking.
- Work on multi-cloud deployment (AWS, Azure, GCP) with portability.
Requirements :
- Strong experience in distributed systems and SaaS architectures.
- Experience in coding Charts, Terraform Scripts, Bash
- Deep expertise in Kubernetes (multi-tenant patterns, operators, scaling).
- Experience with event-driven systems (Kafka / Pulsar).
- Hands-on with databases: Neo4j, Postgres, time-series/log systems.
- Knowledge of multi-tenant security models (isolation, encryption, IAM).
- Experience building control planes / platform engineering systems.
- Familiarity with Terraform / IaC and cloud networking.
- Exposure to observability (MELT) and large-scale systems.
Good to Have :
- Experience with graph-based systems or knowledge graphs.
- Exposure to AI/LLM-integrated platforms.
- Prior work on DevOps / SRE / observability platforms.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1626736