Description :
Role : Senior Consultant Platform Engineer
Role Overview :
- As a Consultant within the Dell AI & Data CoE, you will lead the architecture and implementation of large-scale AI platforms for our most strategic global customers.
- You are not just a builder; you are a technical visionary who helps clients navigate the complexities of the Dell AI Factory with NVIDIA and OSS.
- You will bridge the gap between Dells world-class hardware (PowerEdge, PowerScale, PowerSwitch) and the advanced software orchestration layers (NVAIE, Kubernetes, Slurm) required to turn raw silicon into business value.
Key Responsibilities :
- Lead technical workshops to design Sovereign AI and Private Cloud AI platforms using Dell Validated Designs (DVD).
- Act as a Subject Matter Expert (SME) on the integration of NVIDIA AI Enterprise (NVAIE) with Dell PowerEdge XE servers (H100/H200/B200).
- Develop high-level and low-level designs (HLD/LLD) that incorporate GPU/Network Operators and high-speed InfiniBand/RoCE fabrics.
- Deploy and optimize Red Hat OpenShift and upstream Kubernetes in air-gapped or hybrid-cloud enterprise environments.
- Implement advanced workload scheduling and fractional GPU slicing using Run : ai or Slurm to maximize client ROI on hardware.
- Guide customers in choosing and implementing the right orchestration layer (e.g., BCM for bare metal vs. Kubernetes for microservices).
- Architect end-to-end MLOps pipelines utilizing Kubeflow, MLflow, or ClearML to streamline the "data-to-model" lifecycle.
- Enable distributed training and fine-tuning (LLMs/GenAI) for clients using Ray and PyTorch on Dell infrastructure.
- Integrate Rafay for clients requiring decentralized or multi-cluster AI management across edge and core data centres.
- Contribute to the CoE by developing reusable IP, deployment playbooks, and automated Ansible/Helm/Terraform scripts.
- Mentor junior consultants and lead technical proof-of-concepts (PoCs) that demonstrate the performance of Dell-NVIDIA stacks.
Essential Requirements :
- 10+ years in professional services or consulting, with a heavy focus on AI, Big Data, or HPC infrastructure
- GPU Orchestration : Mastery of NVIDIA GPU Operator, Network Operator, and NVIDIA Base Command Manager (BCM).
- Cloud-Native : Expert-level Kubernetes (CKA/CKS) or Red Hat OpenShift skills, including complex security, CNI (Cilium/Multus) and storage (CSI) configurations.
- Workload Management : Experience with Run : ai, Slurm, or Altair PBS for high-concurrency AI environments.
- ML Platforms : Hands-on experience with Kubeflow, MLflow, Ray, and ClearML.
- Automation : Advanced Ansible, Helm, Terraform, and Python skills for "Infrastructure as Code" delivery.
- Infrastructure : Deep expertise in Dell PowerEdge (XE/R series), PowerScale, and PowerSwitch networking.
Desirable Requirements :
- Communication : Exceptional client-facing e.g., ability to explain complex GPU-to-GPU communication (NVLink/NVSwitch) to C-level stakeholders.
- Travel : Willingness to travel to client sites as needed to lead deployments.
- Preferred Certifications : CKA or Red Hat Certified Specialist, NVIDIA Certified Associate /Professional, Dell PowerEdge /PowerScale Proven Professional.
Who we are :
- We believe that each of us has the power to make an impact.
- Thats why we put our team members at the center of everything we do.
- If youre looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, were looking for you.
- Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play.
- Join us to build a future that works for everyone because Progress Takes All of Us.
- Application closing date : 15 May 2026.
- Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.
- Read the full Equal Employment Opportunity Policy here.