Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

11 results for

Filters (1)

Free Endpoint

7

Partner Endpoint

11

Download Available

4

Launchable

0

Enterprise

0

Use Case

Image-to-Text

1

Code Generation

0

Drug Discovery

0

Retrieval Augmented Generation

0

Object Detection

0

Inference Providers

Fireworks AI

9

Deep Infra

8

GMI Cloud

5

Bitdeer AI

5

Together AI

4

Publisher

Qwen

3

Mistral AI

2

Moonshotai

2

Google

1

DeepSeek AI

1

Blueprint Type

NVIDIA AI

0

NVIDIA Omniverse

0

NVIDIA BioNemo

0

NVIDIA Isaac GR00T

0

Labels (1)

Agentic

Sort By

Free Endpoint

deepseek-v3.1-terminus

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

13.2M

5mo

Free Endpoint

devstral-2-123b-instruct-2512

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

4.65M

3mo

Downloadable

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

1d

Downloadable

glm-5

GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.

35.45M

1mo

Free Endpoint

kimi-k2-instruct

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

20.61M

8mo

Free Endpoint

kimi-k2-instruct-0905

Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.

14M

6mo

Free Endpoint

mistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

6.27M

4mo

Free Endpoint

qwen3-coder-480b-a35b-instruct

Excels in agentic coding and browser use and supports 256K context, delivering top results.

3.68M

7mo

Downloadable

qwen3-next-80b-a3b-instruct

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

20.32M

6mo

Downloadable

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

13.56M

1mo

Free Endpoint

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

8.99M

2mo

Items per page

of 1 pages