Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

8 results for

Filters (2)

Free Endpoint

Partner Endpoint

Download Available

Launchable

Enterprise

Use Case

Code Generation

Drug Discovery

Retrieval Augmented Generation

Image-to-Text

Object Detection

Inference Providers

Deep Infra

Together AI

CoreWeave

Digital Ocean

Fireworks AI

Publisher

NVIDIA

granite-3.3-8b-instruct

Small language model fine-tuned for improved reasoning, coding, and instruction-following

Model

coding

78.41K

8mo

NVIDIA

Downloadable

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

chat

358K

9mo

NVIDIA

Downloadable

llama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

Model

chat

5.84M

8mo

llama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

Model

Instruction following

16.65M

9mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

886K

8mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

3.21M

8mo

NVIDIA

Downloadable

nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Model

chat

12.54M

3mo

NVIDIA

Downloadable

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

Model

chat

31.56M

Items per page

of 1 pages