Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Filters (2)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Drug Discovery

Image-to-Text

Retrieval Augmented Generation

Speech-to-Text

Inference Providers

Deep Infra

Together AI

GMI Cloud

CoreWeave

Digital Ocean

Publisher

NVIDIA

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Items per page

of 1 pages

3.17M

10mo

DownloadableFree Endpoint

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

4.93M

10mo

DownloadableFree Endpoint

Leading reasoning and agentic AI accuracy model for PC and edge.

1.47M

11mo

Advanced LLM for reasoning, math, general knowledge, and function calling

18.79M