Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Model for writing and interacting with code across a wide range of programming languages and tasks.

Advanced programming model for code completion, summarization, and generation

Cutting-edge text generation model text understanding, transformation, and code generation.

Cutting-edge text generation model text understanding, transformation, and code generation.