NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta
Sorting by Last Updated
Sorry, your browser does not support inline SVG.
Loading...

Featured Models

Download Available

z-aiglm5

GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.

AgenticMoEReasoning
Download Available

nvidiallama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

embeddingnemo retrieverRetrieval Augmented GenerationText-to-Embedding
Download Available

moonshotaikimi-k2.5

1T multimodal MoE for high‑capacity video and image understanding with efficient inference.

Mixture-of-ExpertsMultimodalReasoningImage-to-Text
Download Available

nvidiacosmos-reason2-8b

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

Physical AIautonomous vehiclesindustrialreasoningroboticssmart citiesSynthetic Data Generationvideo understandingvision language model
Download Available

nvidianemoretriever-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Chart DetectionObject DetectionTable Detectiondata ingestionnemo retriever
Download Available

nvidianemotron-3-nano-30b-a3b

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

Instruction FollowingLong ContextMoEReasoning
Download Available

nvidianemotron-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

document parsingsupported language - englishtext and table extraction
Download Available

nvidianemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generationImage-to-Textvision assistantvisual question answering
Download Available

openfoldopenfold3

OpenFold3 is a third-generation biomolecular foundation model that predicts the three-dimensional structures of molecular complexes (proteins, DNA, RNA, ligands)

BiologyProtein FoldingDrug Discovery
Download Available

nvidiaparakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

ASRNVIDIA NIMStreamingTaiwaneseSpeech-to-Text