Posted on: 16/05/2025
Job Description :
We are looking for an exceptional Data Scientist with deep expertise in speech technologies, advanced NLP, and LLM fine-tuning to join our cutting-edge AI research team. In this pivotal role, you will be responsible for building and optimizing state-of-the-art machine learning pipelines that drive intelligent audio and language-based products.
Your work will directly contribute to the development of next-generation AI solutions that are privacy-focused, high-performance, and built for scale.
Key Responsibilities :
- Design and implement robust intent detection and entity extraction systems, utilizing transcribed speech, keyword spotting, and semantic pattern recognition.
- Fine-tune LLMs and transformer architectures (BERT, RoBERTa, etc.) for tasks including intent classification, entity recognition, and contextual comprehension.
- Optimize end-to-end pipelines for mobile and on-device inference, employing tools like TFLite, ONNX, quantization, and pruning to achieve low-latency performance.
- Collaborate closely with AI product teams and MLOps engineers to ensure seamless deployment, continuous iteration, and performance monitoring.
Required Technical Skills :
- Strong command of NLP techniques such as keyword spotting, sequence labeling, masked token prediction, and rule-based classification.
- Proven track record in LLM and transformer fine-tuning for NER, intent detection, and domain-specific adaptation.
- Expertise in speech metadata extraction, feature engineering, and signal enrichment.
- Proficiency in model optimization methods like quantization-aware training (QAT), pruning, and efficient runtime deployment for edge devices.
- Excellent Python skills with proficiency in PyTorch or TensorFlow, along with solid experience in NumPy, pandas, and real-time data processing frameworks.
Qualifications :
- Academic or industry background in speech processing, ASR, telecom analytics, or applied NLP is highly desirable.
- Portfolio showcasing real-world speech/NLP projects, open-source contributions, or published research will be a strong advantage.
Experience :
- Proven success in building, deploying, and optimizing ML models for real-time, low-latency environments.
- Contributions to leading open-source projects like openai/whisper, mozilla/DeepSpeech, or facebook/wav2vec2 are highly valued.
Did you find something suspicious?
Posted By
Sumandeep Tuteja
Last Login: NA as recruiter has posted this job through third party tool.
Functional Area
Mobile Development - iOS
Job Code
1480608
Interview Questions for you
View All