HamburgerMenu
hirist

Lead Data Scientist - NLP/Artificial Intelligence

Recruiting Bond
6 - 9 Years
Bangalore

Posted on: 10/04/2026

Job Description

Description :


Lead Data Scientist - AI/ML/NLP


- 69 years


Domain ML Leadership Applied Modelling Experimentation Team Mentorship


The Role :


Lead Data Scientists own the ML function for a defined product domain taking business questions from ambiguity through model training through A/B validation and into production.


You are the first person the product team calls when a metric breaks, the engineer who can explain to a sceptical stakeholder why a model made a specific prediction, and the mentor who raises the quality of the scientists around you.


- Own ML for a product domain end-to-end from OKR translation to production deployment


- Lead a pod of 23 Data Scientists; your code reviews are learning experiences, not gatekeeping


- Drive experimentation culture: every change is a hypothesis, every hypothesis gets a test


- Your models are trusted by the product team because your evaluation is airtight


Core Responsibilities :


Model Ownership Across Platforms :


- Formulate ML problems from product briefs: translate 'improve hotel search' into a ranking objective with measurable proxy and business metrics


- Build, evaluate, and maintain ML models for your domain: ranking, recommendation, price prediction, NLP classification, or fraud scoring


- Own feature engineering for your domain: design features, build pipelines, document semantics, monitor for drift


- Manage model debt: schedule retraining, track performance over time, sunset underperforming models before they cause incidents


AI-First Feature Delivery :


Ranking & Relevance: train and deploy L2R models for your vertical; run position bias experiments; evaluate with both offline and online metrics


Price Intelligence: build price sensitivity models per user segment; deliver fare prediction features for the recommendation feed


Recommendation: implement collaborative filtering or content-based models for cross-sell and upsell surfaces in your domain


LLM/RAG contribution: own a specific RAG pipeline (e.g., hotel cancellation policy Q&A); build evaluation datasets and monitor retrieval quality


Sentiment & Reviews: train review summarisation models; build complaint classification systems for support routing in your vertical


Voice AI contribution: build intent classification models for spoken queries in your domain; evaluate ASR output quality on domain vocabulary


Coupon ML: build personalised discount targeting models; measure redemption lift vs. margin cost with causal evaluation


Experimentation & Analytics :


Design A/B tests for ML model changes: power analysis, SRM detection, guardrail metric selection, and post-experiment diagnostics


Build domain-specific evaluation frameworks: what does 'good' look like for hotel ranking? for fare prediction? you define and defend the answer


Produce experiment read-outs that non-technical stakeholders trust and act on


The AI-First Mandate :


- AI is not an enhancement.


- It is the product architecture.


- Every surface, every API, every decision point is either ML-powered today or on the roadmap to be.


- Search & Ranking Learning-to-Rank across flights, hotels, bus routes, train coaches; real-time re-ranking on user signals


- Voice AI Hindi/Hinglish voice booking, intent resolution, spoken fare comparisons, accessibility-first conversational UX


- RAG Systems - Fare rule retrieval, hotel cancellation policy Q&A, airline contract intelligence, real-time regulatory updates


- Agentic AI - Autonomous booking resolution, exception handling, refund orchestration, supplier communication bots


- MCP Orchestration - Model Context Protocol tool chains across GDS APIs, payment gateways, and supplier integrations


- Recommendation Engine - Cross-vertical next-best-action, collaborative filtering, session-based deep learning


- Price Intelligence - Competitive fare mapping, lower-price guarantee engine, demand elasticity, yield optimisation


- Coupon & Promo ML - Personalised offer targeting, redemption probability scoring, margin-aware discount optimisation


- Sentiment & Review AI - Review summarisation, NPS prediction, complaint triage, trust signal extraction


- Fraud & Risk ML - Anomaly detection, account takeover signals, payment fraud scoring, fake review classification


- Deep System Mapping - Route intelligence, geo-semantic matching, multi-modal journey planning


- Predictive Systems - Cancellation risk, no-show prediction, seat upgrade probability, waitlist conversion


Who You Are :


- 69 years in Data Science/ML with at least 2 end-to-end production deployments from problem definition through monitoring


- Strong in ML fundamentals: you can train a ranking model, run a causal experiment, and explain the result to an engineering director


- Working NLP experience: text classification, embeddings, or fine-tuning LLM experience is a strong plus


- Comfortable leading a small technical pod: delegating work, giving feedback, and holding the quality bar without micromanaging


- Tier-I institute preferred (IIT / IIIT / NIT / IISC / BITS CSE / Statistics / AI)


Technology Stack :


ML/DL : Scikit-learn PyTorch XGBoost LightGBM CatBoost


NLP/LLM : HuggingFace Transformers spaCy LangChain Sentence Transformers


Data Eng : Python SQL Spark Airflow dbt Feast


Serving & Exp : FastAPI MLflow Docker Weights & Biases

info-icon

Did you find something suspicious?

Similar jobs that you might be interested in