Posted on: 31/07/2025
Job Description:
We are hiring for an LLM Ops Support Engineer to support the operations and maintenance of Large Language Model (LLM)-based enterprise solutions. The role includes model management, incident handling, performance monitoring, and implementation of guardrails to ensure reliable and ethical AI outcomes. The ideal candidate should have solid experience working with Azure-based GenAI solutions and be comfortable handling production-grade support environments.
Key Responsibilities :
- Provide operational support for LLM models, including incident management, ongoing model maintenance, and user support.
- Monitor and maintain deployed GenAI solutions including performance tracking, user interaction analysis, and efficacy reporting.
- Review and monitor AI guardrails such as bias audits and other defined governance checks based on deployed solutions.
- Support codebase management with a focus on proactive code optimization beyond break-fix efforts.
Required Skills :
- Strong experience in developing, deploying, and managing LLMs on Microsoft Azure.
- Proven expertise in Azure services including Azure Machine Learning, Azure Kubernetes Service, and Azure DevOps.
- In-depth knowledge of LLMOPS guardrails implementation practices.
- Excellent problem-solving skills and ability to collaborate effectively in a team-based environment.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
ML / DL Engineering
Job Code
1522132
Interview Questions for you
View All