
Excels in agentic coding and browser use and supports 256K context, delivering top results.

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Model for writing and interacting with code across a wide range of programming languages and tasks.

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

Advanced programming model for code completion, summarization, and generation

Cutting-edge text generation model text understanding, transformation, and code generation.

Cutting-edge text generation model text understanding, transformation, and code generation.