Posted on: 19/08/2025
Job Title : Principal Machine Learning Architect Speech & Multimodal AI.
Experience : 10+ years.
Location : Remote.
Type : Contract.
About the Role :
We are seeking a highly experienced Principal Machine Learning Architect with deep expertise in Speech, NLP, Computer Vision, and Multimodal AI.
This role requires strong leadership in designing, building, and optimizing advanced ML architectureswith a focus on speech recognition (ASR), text-to-speech (TTS), and Large Language Models (LLMs) for real-world multimodal applications.
Key Responsibilities :
- Architect, design, and optimize ML models in NLP, Computer Vision, Speech Recognition, and TTS.
- Build and fine-tune Transformer and LLM-based models for speech and multimodal use cases.
- Create and deploy a Speech LLM-based virtual assistant solution.
- Develop scalable ML pipelines for data preparation, feature engineering, training, and evaluation.
- Work on audio signal processing including encoding, decoding, modality fusion, and LLM interaction.
- Implement automated training, deployment, and monitoring workflows using MLOps frameworks.
- Collaborate with cross-functional teams to deliver production-grade AI solutions.
Required Skills & Qualifications :
- 10+ years of experience in Machine Learning, Deep Learning, or AI, with a strong focus on Speech and NLP.
- Proven expertise in ASR (Automatic Speech Recognition), TTS (Text-to-Speech), Multimodal AI, and Audio Signal Processing.
- Deep understanding of Transformer Architectures and LLM fine-tuning.
- Strong programming skills in Python, with hands-on experience in PyTorch and/or TensorFlow.
- Experience with Docker, Kubernetes, Airflow, and serverless computing.
- Prior experience designing automated model pipelines and implementing MLOps best practices.
- Strong problem-solving, leadership, and communication skills.
Did you find something suspicious?