Posted on: 07/11/2025
Description :
Roles & Responsibilities :
- We are looking for a passionate machine learning expert to lead design and development of next-gen video intelligence efforts that drive key applications powered by multimodal video understanding.
- You have significant experience in building reliable machine learning products in the computer vision domain.
- Furthermore, you are passionate about multi modal learning and how AI can transform the media & entertainment industry and make all the amazing content accessible to everyone around the world.
- Architect, build and scale computer vision and multimodal capabilities to support video understanding use-cases within WBD
- Work closely with Principal Engineers on defining technical vision and strategy for the platform, as well as on prioritizing capabilities alongside with Product team
- Develop or fine-tune models for fine grained scene segmentation, annotation, summarization, metadata generation, and selection from videos.
- Use open-source and off-the-shelf gen AI models as the starting point for model fine-tuning and improvements, maximizing precision and recall on WBD content and objectives
- Help build a culture of innovation and engineering excellence by providing guidance to a team of machine learning engineers
- Collaborate with adjacent engineering teams to ensure tight integrations to production pipelines in media supply chain and video streaming applications
- Evaluate and work with vendors to ensure product deliverables and engineering standards are met, when applicable
- Be a thought leader and engineering and operational excellence champion.
- Demonstrate and foster a culture of growth mindset and curiosity for keeping up with the evolving state of the art and be nimble to rapidly adapt to new models and architectures
- Mentor, influence engineers across organizations and lead by example with high quality work at the organization level
What to Bring :
- 9- 12 years of coding experience in the industry (C/C++, Java, Go, Python)
- 5+ years of applied machine learning experience to CV tasks
- Masters/PhD degree required
- Excellent familiarity with state-of-the-art large-language-models, image and video models and their applications to multi modal tasks
- Experience in video representation learning (e.,TimeSformer/SlowFast/X3D), shot/scene segmentation, keyframe extraction, visual retrieval (CLIP/BLIP/SigLIP or custom dual encoders).
- Retrieval and re-ranking (e., HNSW/FAISS)
- Scaling model training/inference on multi-GPUs
- Ability to implement algorithms from state-of-the-art papers in the domain of computer vision (in particular, video processing)
- Excellent architecture and design skills applied to Machine Learning products
- Experience with PyTorch and/or TensorFlow
- Very strong foundations in Computer Science, Data Structures and Algorithms
- Ability to transfer high level, abstract and ambiguous tasks into a set of clear technical requirements
- Ability to understand and embrace complex business interest and operations, and communicate the impact of AI in less technical terms with stakeholders
- Systematic and pragmatic engineer, balancing business sense of urgency with engineering excellence
- Ability to assess different solutions for the same problem with key metrics, trade-offs and be able to make an informed decision
What We Offer :
- A Great Place to work.
- Equal opportunity employer
- Fast track growth opportunities
Did you find something suspicious?