Open
AI Platform Engineer – NVIDIA-Driven On-Prem AI Platform
Job Description
As an AI Platform Engineer, you will develop, operate, and optimize an on-prem AI platform built on NVIDIA AI Enterprise. You ensure high performance, security, and reliability so that data scientists and engineering teams can efficiently leverage GPU resources. The role includes managing Kubernetes across GPU nodes, orchestrating workloads via Run:AI, and maintaining NVIDIA drivers, CUDA, and the full AI Enterprise stack. You will help shape a scalable platform that accelerates experimentation, training, and deployment of advanced AI models.
TopSkills
- NVIDIA AI Enterprise (NeMo, NIM, Triton)
- Kubernetes on GPU nodes
- Run:AI / GPU orchestration
- NVIDIA drivers & CUDA
- Infrastructure engineering
- Platform reliability & security