Posted on: 19/12/2025
Description :
Experience : 10 to 15 years
Location : Pune
Overview :
Key Responsibilities :
- Design system-level LLMOps and MLOps frameworks (model lifecycle, retraining, observability, feedback loops).
- Establish CI/CD, monitoring, and scalability strategies for AI microservices.
- Define model optimization pipelines (quantization, distillation, pruning, caching).
- Integrate MLflow and Vertex AI for experiment tracking and production deployment.
- Drive cloud infrastructure strategy compute scaling, GPU optimization, container orchestration.
- Ensure AI safety, interpretability, and compliance across models and agents.
Qualifications : BTech, BE, MCA
Essential skills :
Core Skills :
- MLOps / LLMOps : MLflow, Vertex AI Pipelines, Weights & Biases, model monitoring.
- DevOps Infrastructure : Kubernetes, Docker, Terraform, Jenkins, GCP/AWS CI/CD.
- Optimization Techniques : Finetuning (LoRA, QLoRA), model quantization, distillation, caching.
- System Design : Scalable APIs, message queues, observability, and fault tolerance.
- Programming : Python
- Cloud Platforms : Google Cloud (Vertex AI), AWS, or Azure.
Desired skills :
- Strong understanding of AI safety, governance, and trust frameworks.
- Experience implementing MCP, multi-agent orchestration, or custom reasoning layers.
- Poven success in leading enterprise-scale AI transformation initiatives.
Did you find something suspicious?