NVIDIA
Explore Models Blueprints GPUs Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUsDocsForums
models
ReasoningVisionVisual DesignRetrievalSpeechBiologySimulationClimate & WeatherSafety & ModerationRun on RTX
industries
AutomotiveGamingHealthcareIndustrialRobotics

Discover

Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Build with gpt-oss: OpenAI's Latest Open-Weight Reasoning Model

Try Now

Achieves near-parity with o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU.

Customize a Blueprint

View All

Get started with workflows and code samples to build AI applications from the ground up.

nvidiaBuild an AI Agent for Enterprise Research

Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

blueprintllama nemotronnimnemo retrieverreasoningretrieval-augmented generationenterpriselaunchablenvidia ai

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

blueprintenterpriselaunchablenvidia aichatgenerative aivideo-to-textvision

nvidiaBuild an Enterprise RAG pipeline

Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

blueprintnimnemo retrieverretrieval-augmented generationenterpriselaunchablenvidia ai

nvidiaSafety for Agentic AI

Improve safety, security, and privacy of AI systems at build, deploy and run stages.

blueprintnemo guardrailslaunchablenvidia aiopen modelsprivacysafetysecurity

Featured Models

View All

The leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

Run Anywhere

openaigpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

mathchatreasoningtext-to-text
Run Anywhere

openaigpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

mathchatreasoningtext-to-text
PREVIEW

nvidiallama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

advanced reasoningfunction callinginstruction followingmath
PREVIEW

moonshotaikimi-k2-instruct

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

advanced reasoningagenticcodingchat