Posted on: 06/01/2026
Description :
DevOps / SRE Engineer (4 Years Experience)
Location : Bangalore (Onsite) | Type : Full-time
About Cautio (Bangalore) :
We are looking for a DevOps / Site Reliability Engineer (SRE) who can take ownership of our cloud infrastructure and bring expertise in building highly reliable systems following SRE best practices.
Key Responsibilities :
- Design, build, and maintain Infrastructure as Code (IaC) using Terraform for consistent and scalable environments.
- Develop, monitor, and improve AWS-based infrastructure including API Gateway, ECS, EC2, S3, RDS, and Event
Bridge.
- Create and manage robust CI/CD pipelines for seamless application deployment and rollbacks.
- Implement and manage containerized workloads with Docker and orchestrate them using Kubernetes (basic to intermediate level).
- Apply SRE best practices : Define and measure SLIs, SLOs, and error budgets; set up monitoring, logging, and
alerting; drive incident response and postmortems; automate operational tasks.
- Work with development teams to enable high availability, scalability, and security for applications.
- Apply strong Linux administration and networking knowledge to troubleshoot and optimize systems.
- Continuously analyze system performance and recommend improvements to ensure optimal performance and cost efficiency.
Required Skills & Experience :
- Proficiency in Terraform for infrastructure provisioning.
- Strong expertise with AWS services (API Gateway, ECS, EC2, S3, RDS, EventBridge).
- Experience implementing and maintaining CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI/CD, etc.).
- Good understanding of Linux systems and networking fundamentals (TCP/IP, DNS, VPNs).
- Hands-on with Docker and Kubernetes (basic/intermediate level) for container orchestration.
- Practical knowledge of SRE principles : SLIs/SLOs, error budgets, reliability engineering, incident management.
- Excellent debugging, troubleshooting, and problem-solving skills.
- Self-starter with the ability to take ownership and work in a dynamic, fast-paced environment.
- Mandatory : Must be willing to work onsite at our Bangalore office.
Nice-to-Have (Good to Bring, Not Mandatory) :
- Experience with observability tools (Prometheus, Grafana, Datadog, ELK).
- Knowledge of DevSecOps practices for integrating security into CI/CD and infrastructure.
- Scripting experience in Python or Shell for automation.
- Exposure to designing high availability and disaster recovery strategies.
This is a hands-on role where youll have the opportunity to directly influence how infrastructure and reliability are built for scale in a startup environment.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1597324