Open

Applied AI Engineer – On-Prem LLM & Automation Specialist

Posted 2 weeks ago by Daniel Fransson
Stockholm
Apply Now

Apply for this job

Job Description

We are seeking a hands-on Applied AI Engineer to build and optimize advanced AI-driven automation flows on our on-prem NVIDIA-powered AI platform. You work closely with end-users such as analysts, legal teams, economists and IT to translate real operational needs into robust LLM applications. The role includes developing secure MCP-based solutions, packaging services for Kubernetes GPU environments, and designing high-quality RAG pipelines and multi-agent orchestration patterns that scale reliably in an enterprise setting.

TopSkills

  • Python (FastAPI, Pydantic AI, Asyncio)
  • LLM-application development
  • Kubernetes on NVIDIA GPU
  • MCP: Auth, authorization & API management
  • RAG stack: Milvus/pgvector/Redis, embeddings, tuning
  • Agent orchestration & automation flows
  • GitOps, Helm/Kustomize, ArgoCD
  • Secure internal app development
  • NVIDIA AI Enterprise, NeMo, NIM