HamburgerMenu
hirist

Tekonika - Senior Multimodal AI Engineer- Vision & LLM Systems

Tekonika Technologies
5 - 10 Years
Multiple Locations

Posted on: 13/04/2026

Job Description

Role Overview :

As a Senior Multimodal AI Engineer - Vision & LLM Systems, you will be at the forefront of developing and implementing innovative AI solutions that integrate computer vision and large language models. You will collaborate closely with cross-functional teams, including research scientists, product managers, and software engineers, to design, build, and deploy state-of-the-art AI systems.


Your work will directly impact the development of next-generation AI applications, enhancing user experiences and driving significant business value across multiple industries.

Key Responsibilities :

- Design and develop novel multimodal AI systems that combine computer vision and large language models to solve complex real-world problems.

- Research and implement cutting-edge techniques in areas such as image recognition, object detection, natural language processing, and multimodal fusion.

- Collaborate with product managers to define product requirements and translate them into technical specifications.

- Build and maintain scalable and robust AI pipelines for training, evaluating, and deploying multimodal AI models.

- Evaluate and benchmark different AI models and algorithms to identify the best solutions for specific use cases.

- Contribute to the development of internal tools and libraries to streamline the AI development process.

- Mentor junior engineers and provide technical guidance on AI-related topics.

- Stay up-to-date with the latest advancements in AI research and identify opportunities to incorporate them into our products.

Required Skillset :

- Demonstrated ability to design, develop, and deploy multimodal AI systems using computer vision and large language models.

- Proven expertise in deep learning frameworks such as TensorFlow or PyTorch.

- Strong understanding of computer vision techniques, including image recognition, object detection, and image segmentation.

- Solid knowledge of natural language processing techniques, including text classification, named entity recognition, and machine translation.

- Excellent programming skills in Python and experience with relevant libraries such as OpenCV, NumPy, and scikit-learn.

- Ability to communicate complex technical concepts clearly and effectively to both technical and non-technical audiences.

- Possess a Master's or Ph.D. degree in Computer Science, Artificial Intelligence, or a related field.

- Capable of working independently and collaboratively in a fast-paced, dynamic environment, whether remotely or in a hybrid setup.

The job is for:

May work from home
info-icon

Did you find something suspicious?

Similar jobs that you might be interested in