Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

6 results for

Filters (3)

Free Endpoint

Partner Endpoint

Download Available

Launchable

Enterprise

Use Case

Code Generation

Drug Discovery

Retrieval Augmented Generation

Image-to-Text

Object Detection

Inference Providers

Together AI

Deep Infra

Fireworks AI

GMI Cloud

Bitdeer AI

Publisher

NVIDIA

Mistral AI

Qwen

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

chat

371K

9mo

NVIDIA

Downloadable

llama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

Model

chat

5.61M

8mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

8mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

2.94M

8mo

Mistral AI

Free Endpoint

magistral-small-2506

High performance reasoning model optimized for efficiency and edge deployment

Model

coding

1.84M

8mo

Qwen

Free Endpoint

qwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

Model

coding

1.98M

9mo

Items per page

of 1 pages