- Deploy and fine-tune self-hosted large language models (LLMs) such as LLaMA, Mistral, and MPT for use cases like personalized recommendations and age-aware product tagging.

AI Assistants & Search Tools :

- Develop internal AI assistants and intelligent search solutions to enhance user experience and operational efficiency.

RAG Pipelines & Vector Search :

- Design and implement Retrieval-Augmented Generation (RAG) pipelines using LangChain, LlamaIndex, and vector databases such as FAISS, Pinecone, or Weaviate.

Moderation & Compliance :

- Build scalable moderation systems to flag and manage unsafe or non-compliant products effectively.

Cloud-Based Backend Engineering :

- Architect and manage scalable backend systems on AWS using services like Lambda, API Gateway, ECS/Fargate, S3, and CloudWatch to ensure low-latency, auto-scalable AI service delivery.

Model Deployment Automation :

- Set up CI/CD pipelines to automate model deployments, updates, and monitoring for seamless AI operations.

AI API Integration :

- Integrate and manage third-party AI APIs (e.g., OpenAI, Anthropic, Perplexity) and assess performance trade-offs between local vs. hosted model deployments.

Personalization & Operational Intelligence :

- Build intelligent personalization engines that adapt based on childs age, gender, usage patterns, urgency cues (e.g., diaper restocking), and other contextual signals.

Real-Time Ops Modeling :

- Support logistics and operations with AI models for demand forecasting, ETA predictions, and rider/route optimization.

Required Skills :

- 36 years of hands-on experience in AI/ML engineering with a focus on production-grade systems

- Proven track record in deploying and fine-tuning self-hosted LLMs (e.g., LLaMA, Mistral, MPT)

Strong command of :

- Programming & Frameworks: Python, FastAPI, PyTorch, HuggingFace Transformers

- DevOps & Containers: Docker, CI/CD automation

- Cloud Infrastructure: AWS (Lambda, ECS/Fargate, S3, CloudWatch, DynamoDB)

- AI Tooling: LangChain, LlamaIndex, vector databases (FAISS, Pinecone, Weaviate)

- API Integration: Experience working with AI APIs (OpenAI, Claude, Gemini, etc.)

Nice to Have :

- Prior experience in e-commerce, quick-commerce (Q-commerce), or logistics environments

- Familiarity with child safety standards and parenting-oriented product ecosystems

- Knowledge of MLOps tooling (MLflow, Airflow) and GPU infrastructure optimization for inference at scale

Did you find something suspicious?

Posted By

Shipra Bathla

HR Manager at OZI TECHNOLOGIES PRIVATE LIMITED

Last Active: 5 Dec 2025

Job Views:
1688

Applications: 609

Recruiter Actions: 237

Posted in

AI/ML

Functional Area

ML / DL / AI Research

Job Code

1516512

Jobs by location

Interview Questions for you

View All

Top 25 LLM Interview Questions and Answers

Top 50+ GitHub Interview Questions and Answers

Top 25+ Database Testing Interview Questions and Answers