Posted on: 26/07/2025
Company Overview :
Job Responsibilities :
- Lead the architecture and implementation of MLOps/LLMOps systems within OpenShift AI, establishing best practices for scalability, reliability, and maintainability while actively contributing to relevant open source communities.
Required Qualifications :
- 5+ years of software engineering experience, with at least 4 years focusing on ML/AI systems in production environments.
- Strong expertise in Python, with demonstrated experience building and deploying production ML systems.
- Deep understanding of Kubernetes and container orchestration, particularly in ML workload contexts.
- Extensive experience with MLOps tools and frameworks (e.g., KServe, Kubeflow, MLflow, or similar).
- Track record of technical leadership in open source projects, including significant contributions and community engagement.
- Proven experience architecting and implementing large-scale distributed systems.
- Strong background in software engineering best practices, including CI/CD, testing, and monitoring.
- Experience mentoring engineers and driving technical decisions in a team environment.
Preferred Qualifications
- Experience with Red Hat OpenShift or similar enterprise Kubernetes platforms.
- Contributions to ML/AI open source projects, particularly in the MLOps/GitOps space.
- Background in implementing ML model monitoring.
- Experience with LLM operations and deployment at scale.
- Public speaking experience at technical conferences.
- Advanced degree in Computer Science, Machine Learning, or related field.
- Experience working with distributed engineering teams across multiple time zones.
What We Offer
- Health Insurance: We provide medical coverage up to 20 lakh per annum, which covers you, your spouse, and a set of parents.
This is available after one month of successful engagement.
- Professional Development: You'll have access to a monthly upskill allowance of ?5000 for continued education and certifications to support your career growth.
- Leave Policy: Vacation Leave (VL): 10 days per year, available after probation.
You can carry over or encash up to 5 unused days.
Casual Leave (CL): 8 days per year for personal needs or emergencies, available from day one.
Sick Leave: 12 days per year, available after probation.
- Flexible Work Hours or Remote Work Opportunities Depending on the role and project.
- Outsourced Benefits such as Paternity Leave, Maternity Leave, etc.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
ML / DL Engineering
Job Code
1519881
Interview Questions for you
View All