Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

17 results for

Filters (1)

API Endpoint

Download Available

Launchable

Use Case

Image-to-Text

Publisher

Mistral AI

Qwen

NVIDIA

DeepSeek AI

Moonshotai

Blueprint Type

NVIDIA AI

Labels (1)

chat

Sort By

Z.ai

glm5

GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.

Model

MoE

7.94M

Stepfun-ai

step-3.5-flash

200B open-source reasoning engine with sparse MoE powering frontier agentic AI.

Model

Agentic

7.29M

1mo

Minimaxai

minimax-m2.1

MiniMax M2.1 excels in multi-language coding, app/web dev, office AI, and agent integration

Model

Agentic

8.38M

1mo

Qwen

qwen3-next-80b-a3b-instruct

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

Model

chat

11.15M

5mo

Moonshotai

kimi-k2-instruct

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

Model

coding

20.23M

7mo

Qwen

qwen3-coder-480b-a35b-instruct

Excels in agentic coding and browser use and supports 256K context, delivering top results.

Model

agentic coding

3.83M

6mo

Qwen

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

Model

MoE

6.55M

Mistral AI

mistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

Model

language generation

6.17M

3mo

DeepSeek AI

deepseek-v3.1-terminus

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

Model

tool calling

13.01M

5mo

Mistral AI

devstral-2-123b-instruct-2512

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

Model

coding

5.78M

3mo

Moonshotai

kimi-k2-instruct-0905

Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.

Model

long-context

10.04M

5mo

DeepSeek AI

deepseek-v3.2

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

Model

long context

15.64M

2mo

Z.ai

glm4.7

GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.

Model

Tool Calling

17.75M

1mo

NVIDIA

llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

Model

chat

606K

8mo

Mistral AI

mistral-nemotron

Built for agentic workflows, this model excels in coding, instruction following, and function calling

Model

language generation

721K

9mo

NVIDIA

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

Model

thinking budget

753K

6mo

ByteDance

seed-oss-36b-instruct

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Model

thinking budget

3.46M

6mo

Items per page

of 1 pages