Open
Applied AI Engineer – On-Prem LLM & Automation Specialist
Job Description
We are seeking a hands-on Applied AI Engineer to build and optimize advanced AI-driven automation flows on our on-prem NVIDIA-powered AI platform. You work closely with end-users such as analysts, legal teams, economists and IT to translate real operational needs into robust LLM applications. The role includes developing secure MCP-based solutions, packaging services for Kubernetes GPU environments, and designing high-quality RAG pipelines and multi-agent orchestration patterns that scale reliably in an enterprise setting.
TopSkills
- Python (FastAPI, Pydantic AI, Asyncio)
- LLM-application development
- Kubernetes on NVIDIA GPU
- MCP: Auth, authorization & API management
- RAG stack: Milvus/pgvector/Redis, embeddings, tuning
- Agent orchestration & automation flows
- GitOps, Helm/Kustomize, ArgoCD
- Secure internal app development
- NVIDIA AI Enterprise, NeMo, NIM