Key Responsibilities :
LLM Deployment & Fine-Tuning :
- Deploy and fine-tune self-hosted large language models (LLMs) such as LLaMA, Mistral, and MPT for use cases like personalized recommendations and age-aware product tagging.
AI Assistants & Search Tools :
- Develop internal AI assistants and intelligent search solutions to enhance user experience and operational efficiency.
RAG Pipelines & Vector Search :
- 36 years of hands-on experience in AI/ML engineering with a focus on production-grade systems
- Proven track record in deploying and fine-tuning self-hosted LLMs (e.g., LLaMA, Mistral, MPT)
Strong command of :
- Programming & Frameworks: Python, FastAPI, PyTorch, HuggingFace Transformers
- DevOps & Containers: Docker, CI/CD automation
- Cloud Infrastructure: AWS (Lambda, ECS/Fargate, S3, CloudWatch, DynamoDB)
- AI Tooling: LangChain, LlamaIndex, vector databases (FAISS, Pinecone, Weaviate)
- API Integration: Experience working with AI APIs (OpenAI, Claude, Gemini, etc.)
Nice to Have :
- Prior experience in e-commerce, quick-commerce (Q-commerce), or logistics environments
- Familiarity with child safety standards and parenting-oriented product ecosystems
- Knowledge of MLOps tooling (MLflow, Airflow) and GPU infrastructure optimization for inference at scale
Did you find something suspicious?