HamburgerMenu
hirist

Job Description

About The Company :

Ara's Client is a B2B SaaS based company that helps online retailers and marketplaces make more business by using AI-powered advertising and marketing tools. The company was founded in 2016 and is based in San Francisco.


The Role :


We are seeking a highly skilled Staff DevOps Engineer to architect and maintain a highly available, global infrastructure capable of handling high QPS systems with 99.99% uptime. The role requires expertise in managing deployments across multiple regions, ensuring fault-tolerant systems, and driving scalability for mission-critical applications.


Key Responsibilities :


- Architect, manage, and scale Kubernetes clusters for high throughput and low latency across multiple global regions.

- Design and maintain Infrastructure as Code (IaC) to support a fault-tolerant, globally distributed architecture.

- Build and optimize CI/CD pipelines to ensure smooth, zero-downtime deployments.

- Ensure 99.99% availability for high QPS applications by implementing robust monitoring, incident management, and failover strategies.

- Manage multi-region deployments to enable low-latency, geo-redundant infrastructure.

- Collaborate with cross-functional teams to ensure security, scalability, and operational efficiency.

- Lead and mentor a high-performing DevOps team, fostering a culture of excellence and innovation.


Skills Required :


- 7 to 10 years of experience managing large-scale, high-availability systems.

- Experience in B2B SAAS Company is a must.

- Proven expertise in Kubernetes administration, including multi-region deployments and scaling for high QPS.

- Deep experience with IaC tools like Terraform or CloudFormation.

- Hands-on with CI/CD pipelines for global, multi-region deployments.

- Strong understanding of cloud platforms (AWS, GCP, or Azure) and geo-redundant architecture.

- Proficient in Linux, scripting (Bash, Python), and troubleshooting large-scale distributed systems.

- Experience leading teams and solving complex, production-grade system challenges.


Qualifications & Experience :


- 6 to 12 years of experience managing large-scale, high-availability systems.

- Education: BE, BCA, Btech


info-icon

Did you find something suspicious?