NVIDIA
Explore Models Blueprints GPUs Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUsDocsForums
models
ReasoningVisionVisual DesignRetrievalSpeechBiologySimulationClimate & WeatherSafety & ModerationRun on RTX
industries
AutomotiveGamingHealthcareIndustrialRobotics

Discover

Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Build with gpt-oss: OpenAI's Latest Open-Weight Reasoning Model

Try Now

Achieves near-parity with o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU.

Featured Models

View All

The leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

PREVIEW

deepseek-aideepseek-v3.1

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

reasoningtext-to-text
Run Anywhere

openaigpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

mathchatreasoningtext-to-text
PREVIEW

nvidianvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

mambaagenticnanoreasoningslmthinking budgetthroughput
PREVIEW

nvidiacosmos-reason1-7b

Reasoning vision language model (VLM) for physical AI and robotics.

physical aiautonomous vehiclesindustrialreasoningroboticssmart citiessynthetic data generationvideo understandingvision language model

Customize a Blueprint

View All

Get started with workflows and code samples to build AI applications from the ground up.

nvidiaBuild an AI Agent for Enterprise Research

Build a custom deep researcher powered by state-of-the-art models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

blueprintllama nemotronnimnemo retrieverreasoningretrieval-augmented generationenterpriselaunchablenvidia ai

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

blueprintenterpriselaunchablenvidia aichatgenerative aivideo-to-textvision

nvidiaBuild an Enterprise RAG pipeline

Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

blueprintnimnemo retrieverretrieval-augmented generationenterpriselaunchablenvidia ai

nvidiaSafety for Agentic AI

Improve safety, security, and privacy of AI systems at build, deploy and run stages.

blueprintnemo guardrailslaunchablenvidia aiopen modelsprivacysafetysecurity