Role Overview :
The AI/ML Engineer will be responsible for designing, training, fine-tuning, and optimizing AI/ML models, with a strong focus on Agentic AI systems capable of autonomous reasoning, planning, and action execution. The role involves working closely with product, platform, and engineering teams to deploy scalable, efficient, and secure AI models in production environments.
Key Roles & Responsibilities
Agentic AI Model Development
- Design and implement agentic AI architectures (multi-agent systems, tool-using agents, planners, and orchestrators)
- Develop autonomous agents capable of decision-making, reasoning, and task execution
- Integrate LLM-based agents with external tools, APIs, workflows, and SDKs
Model Training & Fine-Tuning :
- Fine-tune LLMs and foundation models using techniques such as:
- Supervised Fine-Tuning (SFT)
- Reinforcement Learning from Human Feedback (RLHF)
- Preference Optimization (DPO, PPO)
- Train domain-specific models using structured and unstructured datasets
- Prepare, clean, label, and validate training datasets
Model Optimization :
- Optimize models for latency, memory, and cost efficiency
- Apply techniques such as:
- Quantization
- Pruning
- Knowledge distillation
- Low-Rank Adaptation (LoRA / QLoRA)
- Enable efficient inference on cloud, edge, or hybrid environments
Evaluation & Validation :
- Define evaluation metrics for:
- Accuracy
- Reasoning quality
- Hallucination reduction
- Task success rate
- Perform benchmarking and A/B testing
- Conduct bias, robustness, and adversarial testing
MLOps & Deployment :
- Implement monitoring for model drift, performance degradation, and failures
- Manage model versioning, rollback, and retraining strategies
- Collaborate with DevOps teams for scalable deployments
Research & Innovation :
- Stay updated with advancements in:
- LLMs
- Agentic AI frameworks
- Multimodal models
- Experiment with new architectures, tools, and optimization methods
- Translate research findings into production-ready solutions
07: - Collaboration & Stakeholder Engagement
- Work closely with:
- Backend & SDK engineers
- Product managers
- Security and compliance teams
- Provide technical guidance and mentorship when required
Required Skills & Qualifications
Technical Skills :
- Strong proficiency in Python
- Experience with PyTorch / TensorFlow
- Hands-on experience with LLMs and agentic frameworks (LangChain, LlamaIndex, AutoGen, CrewAI, etc.)
- Experience in fine-tuning and optimization of large models
- Knowledge of vector databases and retrieval systems
- Familiarity with cloud platforms (AWS / GCP / Azure)
Preferred Skills :
- Experience with multimodal models (text, voice, vision)
- Knowledge of distributed training (DeepSpeed, FSDP)
- Experience deploying AI models in production
- Understanding of AI safety and explainability