Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

52 results for

Filters (1)

API Endpoint

30

Download Available

22

Enterprise

0

Launchable

0

Use Case

Code Generation

22

Text Translation

2

Text-to-Speech

0

AI Agent

0

Image Generation

0

Publisher

Microsoft

8

Meta

7

Google

6

NVIDIA

5

Qwen

4

Blueprint Type

NVIDIA AI

0

NVIDIA Isaac GR00T

0

NVIDIA Omniverse

0

Labels (1)

Text-to-text

Sort By

Free Endpoint

granite-guardian-3.0-8b

Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

494K

1y

Downloadable

qwen3-next-80b-a3b-thinking

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

4.24M

6mo

Free Endpoint

shieldgemma-9b

Guardrail model to ensure that responses from LLMs are appropriate and safe

603K

1y

Downloadable

minimax-m2.5

MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

4.19M

2w

Free Endpoint

breeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

579K

9mo

Free Endpoint

deepseek-v3.1

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

13.99M

6mo

Free Endpoint

deepseek-v3.2

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

16.35M

2mo

Free Endpoint

dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

643K

9mo

Free Endpoint

gemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

749K

9mo

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

564K

9mo

Downloadable

gemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

4.59M

9mo

Downloadable

gemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

4.34K554K

9mo

Free Endpoint

gemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

570K

10mo

Downloadable

gpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

41.01M

7mo

Downloadable

gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

8.46M

7mo

Free Endpoint

jamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

571K

9mo

Free Endpoint

llama-3.1-nemotron-70b-reward

Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

438K

1y

Downloadable

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

16K330K

9mo

Downloadable

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

15.84K690K

9mo

Free Endpoint

mistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

567K

9mo

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

893K

9mo

Free Endpoint

nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

536K

9mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

574K

1y

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

2.79M

9mo

Items per page

of 3 pages