HamburgerMenu
hirist

OZI - AI Engineer - LLM/LangChain

OZI TECHNOLOGIES PRIVATE LIMITED
Kolkata
1 - 5 Years

Posted on: 21/07/2025

Job Description

Key Responsibilities :

LLM Deployment & Fine-Tuning :

- Deploy and fine-tune self-hosted large language models (LLMs) such as LLaMA, Mistral, and MPT for use cases like personalized recommendations and age-aware product tagging.

AI Assistants & Search Tools :

- Develop internal AI assistants and intelligent search solutions to enhance user experience and operational efficiency.

RAG Pipelines & Vector Search :


- Design and implement Retrieval-Augmented Generation (RAG) pipelines using LangChain, LlamaIndex, and vector databases such as FAISS, Pinecone, or Weaviate.

Moderation & Compliance :


- Build scalable moderation systems to flag and manage unsafe or non-compliant products effectively.

Cloud-Based Backend Engineering :


- Architect and manage scalable backend systems on AWS using services like Lambda, API Gateway, ECS/Fargate, S3, and CloudWatch to ensure low-latency, auto-scalable AI service delivery.

Model Deployment Automation :


- Set up CI/CD pipelines to automate model deployments, updates, and monitoring for seamless AI operations.

AI API Integration :


- Integrate and manage third-party AI APIs (e.g., OpenAI, Anthropic, Perplexity) and assess performance trade-offs between local vs. hosted model deployments.

Personalization & Operational Intelligence :


- Build intelligent personalization engines that adapt based on childs age, gender, usage patterns, urgency cues (e.g., diaper restocking), and other contextual signals.

Real-Time Ops Modeling :


- Support logistics and operations with AI models for demand forecasting, ETA predictions, and rider/route optimization.

Required Skills :

- 36 years of hands-on experience in AI/ML engineering with a focus on production-grade systems

- Proven track record in deploying and fine-tuning self-hosted LLMs (e.g., LLaMA, Mistral, MPT)

Strong command of :

- Programming & Frameworks: Python, FastAPI, PyTorch, HuggingFace Transformers

- DevOps & Containers: Docker, CI/CD automation

- Cloud Infrastructure: AWS (Lambda, ECS/Fargate, S3, CloudWatch, DynamoDB)

- AI Tooling: LangChain, LlamaIndex, vector databases (FAISS, Pinecone, Weaviate)

- API Integration: Experience working with AI APIs (OpenAI, Claude, Gemini, etc.)

Nice to Have :

- Prior experience in e-commerce, quick-commerce (Q-commerce), or logistics environments

- Familiarity with child safety standards and parenting-oriented product ecosystems

- Knowledge of MLOps tooling (MLflow, Airflow) and GPU infrastructure optimization for inference at scale


info-icon

Did you find something suspicious?