Posted on: 13/07/2025
Position Overview
As an AI Solution Architect with 8 +years of experience, you will lead the design, development, and implementation of cutting-edge AI solutions that solve complex business problems. You will architect and deploy transformer-based models and generative AI systems that deliver measurable business value while ensuring security, scalability, and optimal performance. Strong project management capabilities with experience leading cross-functional technical teams and exceptional communication skills with ability to present complex solutions to senior leadership & stakeholders.
Key Responsibilities :
- Design and architect enterprise-grade AI solutions with emphasis on transformer architectures and generative AI systems
- Develop and implement strategies for training, fine-tuning, and deploying open-source LLMs (Large Language Models)
- Implement cost-efficient and low-latency architectures for LLM inference services
- Build secure API frameworks for generative AI data transmission, processing, and reception
- Design optimized pipelines for processing multimodal data including text, images, and video for vector embeddings
- Lead technical discovery sessions with stakeholders to translate business requirements into AI solution designs
- Create detailed technical specifications, reference architectures, and implement roadmaps
- Engineer scalable solutions capable of handling increased request volumes and data storage needs
- Develop MVPs from proof-of-concepts, accelerating the development of the lifecycle of AI products
- Provide technical leadership for AI development teams using agile methodologies
Required Qualifications :
- 8+ years of experience in software development with at least 5 years focused on AI/ML solutions
- Extensive experience with transformer-based models (Anthropic, GPT, T5, LLaMA, Mistral) and generative AI technologies
- Proven expertise in fine-tuning and deploying open-source LLMs for production environments
- Deep knowledge of vector databases (Pinecone, Weaviate, Milvus, FAISS) and retrieval-augmented generation
- Strong proficiency in the Azure AI ecosystem, including Azure OpenAI Service, Azure Machine Learning, and Azure Cognitive Services
- Experience with LLM optimization techniques including quantization, distillation, and prompt engineering
- Expertise in designing and implementing secure API frameworks with JWT, OAuth, and API gateways
- Demonstrated ability to create low-latency, high-throughput AI systems using efficient orchestration
- Hands-on experience with containerization (Docker), orchestration (Kubernetes), and microservices architectures
- Proficiency in Python and AI frameworks such as PyTorch, TensorFlow, Hugging Face Transformers, and LangChain
- Experience with MLOps practices and CI/CD pipelines for model deployment and monitoring
- Strategic thinking to align AI solutions with broader business objectives and customer needs
- Collaborative approach to problem-solving with adaptability to rapidly evolving technologies
Preferred Qualifications :
- Experience with multi-modal AI systems integrating vision and language capabilities
- Knowledge of embedding models (CLIP, SBERT, Ada) and their applications
- Expertise in RAG (Retrieval-Augmented Generation) architecture and implementations
- Experience with Azure Kubernetes Service (AKS) for model deployment
- Familiarity with vector search optimization and semantic caching strategies
- Background in implementing AI guardrails and safety measures for generative AI systems
- Experience with streaming inference and real-time AI processing
- Knowledge of distributed training techniques and infrastructure
- Expertise in GPU/TPU utilization optimization for AI workloads
- Experience with enterprise data governance and compliance requirements for AI systems
- Creative perspective for presenting AI strategies and roadmaps to stakeholders with illustrative flow diagrams & engaging content
- Customer-focused mindset with emphasis on delivering tangible business outcomes
- Intellectual curiosity and passion for staying current with emerging AI technologies and implementing PoC level solutions to accelerate and inculcate these solutions in development team
Did you find something suspicious?