Role Overview :

The AI/ML Engineer will be responsible for designing, training, fine-tuning, and optimizing AI/ML models, with a strong focus on Agentic AI systems capable of autonomous reasoning, planning, and action execution. The role involves working closely with product, platform, and engineering teams to deploy scalable, efficient, and secure AI models in production environments.

Key Roles & Responsibilities

Agentic AI Model Development

- Design and implement agentic AI architectures (multi-agent systems, tool-using agents, planners, and orchestrators)

- Develop autonomous agents capable of decision-making, reasoning, and task execution

- Integrate LLM-based agents with external tools, APIs, workflows, and SDKs

Model Training & Fine-Tuning :

- Fine-tune LLMs and foundation models using techniques such as:

- Supervised Fine-Tuning (SFT)

- Reinforcement Learning from Human Feedback (RLHF)

- Preference Optimization (DPO, PPO)

- Train domain-specific models using structured and unstructured datasets

- Prepare, clean, label, and validate training datasets

Model Optimization :

- Optimize models for latency, memory, and cost efficiency

- Apply techniques such as:

- Quantization

- Pruning

- Knowledge distillation

- Low-Rank Adaptation (LoRA / QLoRA)

- Enable efficient inference on cloud, edge, or hybrid environments

Evaluation & Validation :

- Define evaluation metrics for:

- Accuracy

- Reasoning quality

- Hallucination reduction

- Task success rate

- Perform benchmarking and A/B testing

- Conduct bias, robustness, and adversarial testing

MLOps & Deployment :

- Implement monitoring for model drift, performance degradation, and failures

- Manage model versioning, rollback, and retraining strategies

- Collaborate with DevOps teams for scalable deployments

Research & Innovation :

- Stay updated with advancements in:

- LLMs

- Agentic AI frameworks

- Multimodal models

- Experiment with new architectures, tools, and optimization methods

- Translate research findings into production-ready solutions

07: - Collaboration & Stakeholder Engagement

- Work closely with:

- Backend & SDK engineers

- Product managers

- Security and compliance teams

- Provide technical guidance and mentorship when required

Required Skills & Qualifications

Technical Skills :

- Strong proficiency in Python

- Experience with PyTorch / TensorFlow

- Hands-on experience with LLMs and agentic frameworks (LangChain, LlamaIndex, AutoGen, CrewAI, etc.)

- Experience in fine-tuning and optimization of large models

- Knowledge of vector databases and retrieval systems

- Familiarity with cloud platforms (AWS / GCP / Azure)

Preferred Skills :

- Experience with multimodal models (text, voice, vision)

- Knowledge of distributed training (DeepSpeed, FSDP)

- Experience deploying AI models in production

- Understanding of AI safety and explainability