Posted on: 06/08/2025
Overview :
Key Responsibilities :
- Implement infrastructure as code and environment provisioning, including AI-powered script and configuration generation.
- Monitor and optimize system reliability, availability, and security using AI/ML for predictive alerts and intelligent scaling.
- Establish DevSecOps best practices, compliance controls, and MLOps (CI/CD for ML workflows).
- Troubleshoot complex infrastructure, deployment, and ML model serving issues.
Core Technical & AI/ML Skills :
- Build and maintain CI/CD (Jenkins, GitLab CI, GitHub Actions, ArgoCD).
- Cloud infrastructure (AWS, Azure, GCP), container orchestration (Kubernetes, Docker).
- Logging, monitoring, and observability (Prometheus, Grafana, ELK/EFK), including AI-driven log analysis and incident prediction.
- Experience supporting MLOps : deploying ML workflows, ensuring model traceability and compliance.
- Use of AI assistants and workflow tools to script, manage incidents, and enforce security policies (OPA, Sentinel).
Soft Skills :
- Strong communicator across development, security, and business teams.
- Mentorship and process rigor, open-minded to AI-driven productivity improvements.
- High accountability, initiative, and a proactive approach to emerging technologies.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1525452
Interview Questions for you
View All