Posted on: 24/05/2025
About Us :
Role Overview :
Key Responsibilities :
- Design and implement RAG pipelines combining LLMs with structured or unstructured data.
- Fine-tune and prompt engineer LLMs to align with product requirements.
- Optimize inference pipelines for latency and throughput.
- Collaborate with DevOps and platform teams to build robust and scalable infrastructure.
- Stay up to date with the latest trends and advancements in generative AI.
Requirements :
- 1+ years of experience working with LLMs, transformers, or NLP-based systems.
- Proficiency in Python and Golang, and experience with frameworks like LangChain,
LlamaIndex, or Hugging Face Transformers.
- Solid understanding of vector databases (e.g., FAISS, Pinecone, Weaviate).
- Experience integrating APIs and managing cloud-based deployments (AWS, GCP, or Azure).
- Strong problem-solving skills and the ability to work in an agile environment.
Nice to Haves :
- Experience with real-time or low-latency systems.
- Understanding of multimodal models (text + image/audio).
- Prior experience with prompt optimization or LLMOps tooling.
Did you find something suspicious?
Posted By
Posted in
AI/ML
Functional Area
ML / DL Engineering
Job Code
1484807
Interview Questions for you
View All