NVIDIA
Explore
Models
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: visual design
Sorting by Most Recent

nvidianemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generationchatImage-to-Textvision assistantvisual question answering

nvidianvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

thinking budgetchatreasoning

openaigpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

text-to-textchatreasoningmath

googlegemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

googlegemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generationspeech recognitionVisual QAchat

nvidiaBuild Digital Twins for AI Factory Design and Operations

Design, test, and optimize a new generation of intelligence manufacturing data centers using digital twins.

AI FactoryIndustrialNVIDIA OmniverseBlueprintsimulationEnterprise

mistralaimistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generationchatImage-to-Textmultimodalvisual question answering

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationchatImage-to-Textvision assistantvisual question answering

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationchatImage-to-Textvision assistantvisual question answering

nvidiaAI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

BlueprintClimate ScienceEnterpriseWeather SimulationAI Weather PredictionNVIDIA AIEarth-2

cadencespectre-x

Run large-scale electronics and chip design verification simulations

chip-designelectronic-design-automationedasemiconductorintegrated-circuitsdesign-verificationsimulations

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

Vision AssistantchatVisual Question AnsweringLanguage GenerationImage-to-Text

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Speech RecognitionVisual QAchatLanguage GenerationImage-to-TextChart and Table Understanding

nvidiaEvo 2 Protein Design

This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.

blueprintNIMbiologyBioNemoDrug DiscoveryProtein Generation

igeniuscolosseum_355b_instruct_16k

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

Heavy industryGovernmentchatHighly regulated use case supportFinancial services

nvidiaBuild A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

NVIDIA BioNemoBlueprintEnterpriseBioNemoBiologyDrug DiscoveryProtein Generation

nvidia3D Conditioning for Precise Visual Generative AI

Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.

visual designNVIDIA OmniverseBlueprintsimulationEnterprise

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

Image-Text RetrievalVisual QAchatImage-to-TextImage CaptioningVisual Grounding

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

Image-Text RetrievalVisual QAimage captioningchatImage-to-TextVisual Grounding

nvidiaBuild A Generative Virtual Screening Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.

ChemistryNIMNVIDIA BioNemoBlueprintEnterpriseBioNemoDockingDrug Discovery

microsoftphi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

Vision AssistantVisual Question AnsweringLanguage GenerationImage-to-Text

ai21labsjamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatLanguage GenerationText-to-text

nvidianv-dinov2

NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.

Image-to-Embeddingcomputer visiondeepstreamNVIDIA NIMobject Classification

nvidiausdvalidate

Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.

ValidationOpenUSDSynthetic Data GenerationDigital TwinUSDVisualization 3D

nvidiaocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

Optical Character RecognitionimageOptical Character Detectioncvvlmcomputer visionTAO Toolkitvideo

nvidiavisual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

imageImage GenerationcvImage Segmentationvlmcomputer visionTAO ToolkitvideoNVIDIA NIM

ipdrfdiffusion

A generative model of protein backbones for protein binder design.

biologynimBioNemoDrug DiscoveryProtein Generation

googlepaligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

imagecvVision AssistantvlmVisual Question Answeringcomputer visionLanguage GenerationImage-to-Textvideo