Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

9 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Code Generation

Inference Providers

Deep Infra

Together AI

Fireworks AI

GMI Cloud

CoreWeave

Publisher

NVIDIA

DeepSeek AI

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

Model

chat

795K

9mo

NVIDIA

Downloadable

llama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

Model

chat

8.11M

8mo

NVIDIA

Downloadable

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

chat

613K

8mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

1.15M

7mo

NVIDIA

Downloadable

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Model

chat

5.02M

7mo

Moonshotai

Free Endpoint

kimi-k2-thinking

Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.

Model

Conversational

2.88M

3mo

DeepSeek AI

Free Endpoint

deepseek-v3.1-terminus

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

Model

chat

12.33M

5mo

llama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

Model

Instruction following

19.3M

9mo

NVIDIA

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

Model

chat

573K

Items per page

of 1 pages