Try NVIDIA NIM APIs

⌘KCtrl+K

Discover

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models

Free serverless APIs for development

Accelerated by DGX Cloud

Self-Host on your GPU infrastructure

Continuous vulnerability fixes

Design Custom Datasets

Create high-quality, domain-specific synthetic datasets at scale with NeMo Data Designer.

Start Now

Featured Models

View All

The leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

Run Anywhere

nvidia nemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Instruction Following Long Context MoE Reasoning

Run Anywhere

nvidia nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generation Image-to-Text vision assistant visual question answering

Run Anywhere

openai gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

math chat reasoning text-to-text

Run Anywhere

qwen qwen3-next-80b-a3b-thinking

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

Reasoning Text-to-Text

Customize a Blueprint

View All

Get started with workflows and code samples to build AI applications from the ground up.

cyborg Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

Blueprint NIM NeMo Retriever Retrieval-Augmented Generation Launchable

h2o Flood Intelligence

H2O.ai Flood Intelligence provides real-time, scalable intelligence for AI-powered disaster management.

Blueprint Partner Risk Analysis Launchable NVIDIA AI

nvidia Build an AI Agent for Enterprise Research

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

Blueprint Llama Nemotron NIM NeMo Retriever Reasoning Retrieval-Augmented Generation Enterprise Launchable NVIDIA AI

nvidia Build a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

Blueprint Enterprise Launchable NVIDIA AI chat generative AI video-to-text vision