NVIDIA
Explore Models Blueprints GPUs Docs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUsDocsForums
models
ReasoningVisionVisual DesignRetrievalSpeechBiologySimulationClimate & WeatherSafety & Moderation
industries
AutomotiveGamingHealthcareIndustrialRobotics

Automotive

Explore NVIDIA Automotive Blueprints

Comprehensive reference workflows that accelerate automotive software development with NVIDIA acceleration libraries, APIs, NIM, and microservices.

Enterprise

nvidiaBuild a Digital Human

Create intelligent, interactive avatars for customer service across industries

audio-to-faceblueprintchatdigital humansspeech-to-textenterprisenvidia ainvidia omniverse
Enterprise

nvidiaBuild an Enterprise RAG pipeline

Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

blueprintnimnemo retrieverretrieval-augmented generationenterpriselaunchablenvidia ai

Enterprise AI Models for Cloud Deployment

Leverage advanced AI models to streamline automotive software development and optimize cloud deployment.

nvidiacosmos-transfer1-7b

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

autonomous vehiclesphysical airoboticssynthetic data generationvideo-to-world
PREVIEW

nvidiavila

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlmvision language modelimage captionimage to text
Run Anywhere

nvidiacosmos-predict1-7b

Generalist model to generate future world state as videos from text and image prompts to create synthetic training data for robots and autonomous vehicles.

physical aiautonomous vehiclesimage-to-worldroboticstext-to-worldsynthetic data generation
PREVIEW

nvidiacosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

physical aipolicy evaluationroboticssynthetic data generationvideo-to-world
Run Anywhere

nvidianvclip

NV-CLIP is a multimodal embeddings model for image and text.

computer visionnvidia nimrun-on-rtxmultimodal embeddingstext and image
Run Anywhere

metallama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

instruction followingmathreasoningtext-to-textcode generation
Run Anywhere

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

chatsynthetic data generationcode generation
Run Anywhere

qwenqwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationlarge language modelstext-to-textchat
Run Anywhere

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

mathadvanced reasoningchat

AI Models for In-Vehicle Applications

Leverage AI models to improve embedded software for automotive applications, boosting efficiency and accelerating deployment.

PREVIEW

nvidiabevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

automotiveautonomous vehiclesbevperception
PREVIEW

nvidiasparsedrive

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

automotiveautonomous vehiclesav stackbev