HamburgerMenu
hirist

Senior OpenShift Platform Engineer - MLOps/LLMOps

Outsourced
Multiple Locations
5 - 8 Years
star-icon
3.8white-divider4+ Reviews

Posted on: 25/07/2025

Job Description

Job description

Location : Remote.

Type : Full-Time.

Experience Level : Senior (Minimum 5+ years of relevant experience).

Industry : Cloud Infrastructure, AI/ML Ops, Kubernetes.


About the Role :


Were looking for a Senior OpenShift Platform Engineer with 5+ years of hands-on experience in OpenShift and Kubernetes.


In this strategic role, you'll lead the design and implementation of MLOps / LLMOps systems on OpenShift, mentor engineers, and help scale secure, high-performance AI infrastructure in collaboration with cross-functional teams.


Key Responsibilities :


Platform Leadership :


- Architect, install, upgrade, and manage OpenShift clusters, both on bare metal and VMware.


- Lead the deployment of MLOps / LLMOps workflows in OpenShift AI environments.

- Implement production-grade solutions for model deployment, monitoring, and validation pipelines.


Infrastructure Excellence :


- Set up robust monitoring (Prometheus, Thanos, Grafana), logging, and backup strategies.


- Drive improvements in scalability, performance, and reliability of containerized platforms.

- Ensure secure and efficient configurations for RBAC, networking, and persistent storage (NetApp preferred).


Collaboration & Communication :


- Translate customer and product requirements into technical solutions.


- Lead architectural and code reviews across distributed engineering teams.

- Mentor team members and promote a high standard of engineering practices.


Incident Management & Governance :


- Own and lead root cause analysis (RCA) and post-mortem follow-ups.


- Define and enforce platform standards, technical governance, and compliance practices.


Must-Have Qualifications :


- Minimum 5 years of experience in OpenShift cluster installation, management, and lifecycle operations.


- Proven experience designing highly available and scalable systems in enterprise environments.

- Hands-on experience with bare metal or VMware-based OpenShift deployments.

- Deep understanding of Kubernetes/OpenShift security, RBAC, networking, and persistent storage (NetApp preferred).

- Expertise in setting up monitoring, logging, and backup solutions.

- Proficiency with CI/CD DevOps tools such as GitLab and ArgoCD.

- Solid experience with observability tools like Prometheus, Thanos, and Grafana.

- Strong communication and presentation skills to engage both technical and non-technical stakeholders.

- Ability to juggle multiple projects and deliver with minimal oversight.


Bonus Skills (Highly Desirable) :


- Experience with OpenShift AI and deployment of MLOps/LLMOps workflows.


- Familiarity with OpenShift Virtualization or KubeVirt.

- Open-source contributions in Kubernetes/MLOps communities.

- Experience leading customer-facing technical discussions and workshops.


info-icon

Did you find something suspicious?