⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Retrieval Augmented Generation

Synthetic Data Generation

Digital Twin

Drug Discovery

Inference Providers

Deep Infra

Bitdeer AI

CoreWeave

GMI Cloud

Together AI

Publisher

NVIDIA

Qwen

Mistral AI

Abacus.AI

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

code generation

Items per page

of 1 pages

19.74M

2mo

Qwen

Free Endpoint

qwen3-coder-480b-a35b-instruct

Excels in agentic coding and browser use and supports 256K context, delivering top results.

agentic coding

5.23M

9mo

NVIDIA

Free Endpoint

nv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

nemo retriever

330K

Qwen

DeprecatedDownloadable

qwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completion

943K

11mo

NVIDIA

Free Endpoint

usdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

Digital Twin

11mo

Abacus.AI

Free Endpoint

dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Code Generation

604K