Try NVIDIA NIM APIs

Discover

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models

Free serverless APIs for development

Accelerated by DGX Cloud

Self-Host on your GPU infrastructure

Continuous vulnerability fixes

Design Custom Datasets

Create high-quality, domain-specific synthetic datasets at scale with NeMo Data Designer.

Start Now

Featured Models

View All

The leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

nvidia llama-3.1-nemotron-safety-guard-8b-v3

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs

Run Anywhere

nvidia nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

Run Anywhere

openai gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

Run Anywhere

qwen qwen3-next-80b-a3b-thinking

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

Customize a Blueprint

View All

Get started with workflows and code samples to build AI applications from the ground up.

cyborg Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

h2o Flood Intelligence

H2O.ai Flood Intelligence provides real-time, scalable intelligence for AI-powered disaster management.

nvidia Build an AI Agent for Enterprise Research

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

nvidia Build a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A