HamburgerMenu
hirist

Artificial Intelligence Architect - RAG/LLM

Dexian
Pune
12 - 15 Years

Posted on: 10/12/2025

Job Description

About the Role :


We are looking for an experienced AI/ML Architect to lead the design and implementation of advanced Generative AI and RAG (Retrieval-Augmented Generation) solutions.

The role combines hands-on architecture design, pre-sales engagement, and technical leadership across enterprise AI initiatives.

You will drive solutioning around LLMs, knowledge retrieval, and MCP-based multi-agent architectures, helping customers unlock business value from AI responsibly and at scale.

Key Responsibilities :

- Architect and deliver enterprise-grade AI/ML & Generative AI solutions, including RAG pipelines, LLM integrations, and intelligent agents.

- Engage in pre-sales activities: collaborate with business development, present technical solutions, estimate effort, and support proposals/PoCs for prospects.

- Design knowledge retrieval layers using vector databases (FAISS, Pinecone, Milvus, Chroma, Weaviate).

- Develop document ingestion, embedding, and context-retrieval pipelines for unstructured and structured data.

- Architect and manage MCP (Model Context Protocol) servers for secure context exchange, multi-model orchestration, and agent-to-agent collaboration.

- Define LLMOps / MLOps best practices CI/CD for models, prompt versioning, monitoring, and automated evaluation.

- Collaborate with pre-sales and business teams to shape AI solution proposals, PoCs, and client demos.

- Lead AI innovation initiatives and mentor technical teams on GenAI, RAG, and MCP frameworks.

- Ensure data privacy, compliance, and responsible AI across all deployments.

- Work closely with ITS, TIC team to provide mentorship and guidance to AI developers.

Required Skills & Experience :

- 1215 years of overall experience with 57 years in AI/ML and 3+ years in Generative AI / LLM architecture.

- Strong hands-on experience with RAG pipelines, vector search, and semantic retrieval.

- Proven experience integrating LLMs (OpenAI, Claude, Gemini, Mistral, etc.) using frameworks such as LangChain, LlamaIndex, or PromptFlow.

- Deep understanding of MCP servers configuration, context routing, memory management, and protocol-based interoperability.

- Strong programming skills in Python, and familiarity with containerization (Docker, Kubernetes) and cloud AI services (Azure OpenAI, AWS Bedrock, GCP Vertex AI).

- Expertise in MLOps/LLMOps tools (MLflow, KubeFlow, LangSmith, Weights & Biases).

- Solid grounding in data engineering, pipelines, and orchestration tools (Airflow, Prefect).

- Excellent communication, client engagement, and technical presentation skills.

- Proven track record of practice building or leadership in emerging technology domains.

info-icon

Did you find something suspicious?