Try NVIDIA NIM APIs

nvidia llama-3.1-nemotron-nano-4b-v1.1

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

edge tool calling reasoning math nvidia

qwen qwen3-235b-a22b

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

complex math advanced reasoning instruction following qwen

nvidia nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

nemo retriever embedding retrieval augmented generation nvidia

mistralai mistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

code chat reasoning agent-centric multilingual mistralai

qwen qwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completion code generation chat text-to-code qwen

qwen qwen2.5-coder-7b-instruct

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

code completion code generation chat text-to-code qwen

nvidia usdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

openusd synthetic data generation digital twin code generation chat nvidia nim nvidia

abacusai dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

chat code generation text-to-text abacusai

mistralai mamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completion code generation chat code generation mistralai

nv-mistralai mistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

code generation chat language generation text-to-text run-on-rtx nv-mistralai

bigcode starcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

bigcode starcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

google gemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation chat text-to-text language generation google

google gemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation text-to-text language generation google

google codegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

chat code generation code completion google

google codegemma-7b

Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

chat code generation chat language generation text-to-code google

google gemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

chat code generation chat text-to-text language generation google