HamburgerMenu
hirist

Rapid Alpha - Senior Platform Architect - Reliability Engineering

Rapid Alpha
6 - 12 Years
Chandigarh

Posted on: 06/03/2026

Job Description

About Rapid Alpha Remote (India) :

Rapid Alpha is building EVOS, an AI-enabled execution platform used to analyze markets, technologies, and innovation strategies for mid-sized companies.

Our platform processes large document sets, performs AI-driven classification, and supports multi-tenant client workloads.

As adoption grows, the platform must scale reliably under concurrent data processing and AI workloads.

This role exists to ensure the system holds under growth.

You will design and maintain the infrastructure that allows EVOS to scale without sacrificing reliability, performance, or operational discipline.

This is not a traditional DevOps role.

This is a platform architecture role responsible for system reliability, scalability, and infrastructure design.

Role & Responsibilities :

You will own the stability and scalability of the EVOS platform.

Platform Architecture :

- Design and evolve the cloud infrastructure supporting EVOS

- Ensure the platform scales safely under increasing concurrent workloads

- Architect reliable multi-tenant SaaS infrastructure

Workload Isolation & Job Processing :

- Design queue-based systems for heavy background workloads

- Ensure compute-heavy processes do not degrade application performance

- Implement reliable asynchronous job processing

Reliability Engineering :

- Maintain high availability and system uptime

- Implement observability, monitoring, and alerting

- Diagnose and resolve performance bottlenecks

Infrastructure Discipline :

- Introduce best practices for deployment, monitoring, and scaling

- Improve system resilience under real production conditions

- Establish infrastructure patterns that support long-term platform growth

You will work directly with the Principal AI Systems Architect, engineers, and company leadership to ensure the platform scales as Rapid Alpha grows.

The System You Will Work On :

EVOS currently operates on :

- AWS cloud infrastructure

- Laravel-based SaaS platform

- Python services supporting AI workloads

- PostgreSQL database

- Multi-tenant architecture

- Document processing and classification pipelines

- The platform already supports active clients and real workloads. Your role is to professionalize and scale the infrastructure that supports it.

What Success Looks Like :

Success in this role means :

- The platform remains stable under increasing client workloads

- AI workloads run asynchronously without impacting application performance

- Infrastructure scales predictably as the platform grows

- Observability allows rapid diagnosis of issues

- Reliability improves as usage increases

Your work directly impacts client reliability, platform growth, and company profitability.

Preferred Candidate Profile :

We are looking for engineers who have designed and operated production systems at scale, not just managed deployment pipelines.

Minimum qualifications :

- 6- 10+ years of software or infrastructure engineering experience

- Strong experience with AWS-based cloud infrastructure

- Experience designing distributed or microservices-based systems

- Experience implementing queue-based or asynchronous workloads

- Experience supporting production SaaS systems

Experience with technologies such as the following is valuable but not required :

- AWS, Kubernetes


- Docker


- Redis


- Kafka


- Terraform


- Prometheus


- Grafana


- PostgreSQL, Python, or similar infrastructure tools.

We care more about system design, reliability thinking, and ownership than specific tools.

This Role Is Best Suited For Engineers Who :

- Enjoy solving systems and infrastructure problems

- Have owned production reliability in real systems

- Are comfortable designing systems that support real workloads

- Prefer high ownership environments over narrowly defined roles

This Role Is Not A Fit If :

- Your experience is primarily CI/CD pipeline management

- Your work has focused mainly on deployment automation

- You have not been responsible for production infrastructure decisions

This role requires engineers who are comfortable owning system reliability and scalability.

Hiring Timeline :

Rapid Alpha is actively hiring for this role and expects to move quickly.

Qualified candidates can expect :

- Initial screening conversation within a few days of applying

- Technical discussion with the engineering team

- Final interview with company leadership

Our goal is to extend an offer by March 20 for the right candidate.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in