Posted on: 08/12/2025
Job Summary :
We're seeking an experienced CloudOps Engineer III to help evolve and strengthen Urbint's cloud infrastructure and reliability practices. You'll be part of a high-performing Site Reliability Engineering (SRE) and Cloud Operations team responsible for building and maintaining hybrid cloud environments with a focus on uptime, performance, security, and cost efficiency. This is a hands-on, technical role ideal for someone who thrives on automation, scalability, and solving complex infrastructure challenges.
What You'll Do :
- Cloud Infrastructure & Operations: Build, maintain, and scale reliable cloud systems across AWS, GCP, and hybrid environments.
- Infrastructure as Code (IaC): Design and implement repeatable, automated infrastructure deployments using Terraform, Kubernetes, and CI/CD pipelines.
- Monitoring & Incident Response: Develop and maintain monitoring, alerting, and observability using Prometheus, Grafana, and DataDog. Participate in incident response, troubleshooting, and root cause analysis.
- System Reliability & Performance: Partner with engineering teams to improve system resilience, scalability, and performance of distributed microservices.
- Security, Governance & Compliance: Ensure infrastructure meets compliance and security standards (SOC 2, ISO 27001, HIPAA). Implement IAM, encryption, and network security best practices.
- Cost Optimization (FinOps): Analyze cloud resource usage and collaborate with teams to optimize cost and performance.
- Collaboration: Work closely with Product, Engineering, and Security to align infrastructure needs with business goals.
- Continuous Improvement: Contribute to infrastructure design discussions, propose improvements, and champion DevOps best practices across the organization.
Who You Are :
Experience :
- 6-9 years in CloudOps, SRE, or DevOps roles supporting large-scale distributed systems.
- Deep hands-on experience with AWS, GCP, or Azure (minimum 4 years).
- 3+ years of practical experience with Kubernetes, Docker, Terraform, and CI/CD tools.
Technical Skills :
- Strong scripting and automation skills in Python, Go, or Shell.
- Experience with monitoring and logging tools (Prometheus, Grafana, DataDog, ELK, etc.).
- Familiarity with distributed systems, microservices, and cloud-native architectures.
Governance & Compliance :
- Understanding of IAM, VPCs, encryption, and security best practices.
- Exposure to regulated environments (SOC 2, HIPAA, ISO 27001).
Mindset :
- Self-driven, detail-oriented, and passionate about operational excellence.
- Comfortable in collaborative, cross-functional environments.
- Curious, continuously learning, and always seeking to improve systems and processes.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1586479
Interview Questions for you
View All