Posted on: 24/11/2025
Description :
We are looking for a highly skilled Real-Time Voice AI Engineer with hands-on experience in building real-time voice bots, speech pipelines, and streaming AI systems using open-source technologies. This role involves working across the stackspeech recognition, voice generation, WebRTC streaming, ML deployment, and scalable Python microservices.
Key Responsibilities :
Real-Time Voice & Streaming Systems :
- Develop and maintain real-time voice bots using WebRTC, LiveKit, WebSockets, and telephony integrations.
- Build and optimize low-latency voice streaming pipelines for conversational AI applications.
- Ensure secure, robust, and scalable communication channels across all voice systems.
Speech & Generative AI Pipelines :
- Build ASR pipelines using Whisper, VAD, turn detection, and custom inference modules.
- Implement, fine-tune, and deploy Generative AI models using Ollama, vLLM, HuggingFace, etc.
- Optimize TTS streaming using open-source models such as Orpheus, Spark, Fish Audio, or similar.
Backend & ML Infrastructure :
- Develop Python-based inference microservices and MLOps pipelines for scalable deployment.
- Optimize models via quantization, caching, and GPU-aware serving.
- Ensure real-time performance with low-latency audio, fast inference, and efficient resource utilization.
Required Skills & Experience :
- 3+ years of relevant experience in real-time AI/voice streaming development (must be open-source focused).
- Strong proficiency in Python, including multiprocessing, async programming, and microservice architecture.
- Strong understanding of low-latency system design, GPU optimization, and secure streaming
Did you find something suspicious?