Posted on: 03/02/2026
Description :
We are hiring a Senior NLP / AI Engineer to build and scale production-grade AI systems across multilingual NLP, speech, LLM, and RAG use cases for enterprise deployments.
Location : Noida (Hybrid)
Experience : 4+ years (Production ML / NLP)
Job Type : Full-Time
Responsibilities :
- Build multilingual systems supporting 20+ Indian languages.
- Optimize models using quantization, pruning, and inference acceleration.
- Develop real-time and batch pipelines for speech, translation, and RAG workloads.
- Architect and deploy RAG pipelines, embedding models, and vector databases.
- Manage and optimize GPU infrastructure on AWS and GCP.
- Deploy and maintain high-availability AI APIs with monitoring and failover.
Required Skills :
- 4+ years experience in production NLP / ML systems.
- Strong hands-on experience with Python, PyTorch, TensorFlow.
- Experience with Speech-to-Text, Text-to-Speech, Transformer architectures, and open-source LLMs.
- Hands-on with CTranslate2, Faster-Whisper, ONNX Runtime, vLLM / SGLang, and LLM quantization (INT4 / INT8 / AWQ).
- Experience building RAG systems using FAISS or Pinecone.
- Strong Linux, Docker, and FastAPI experience.
- Hands-on experience managing GPU infrastructure on AWS / GCP.
- Strong multilingual NLP experience, especially Indian languages.
Good to Have :
Did you find something suspicious?