NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

ModelsExplore Models
BlueprintsGet Started with Blueprints
GPUsLaunch a GPU Instance

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for developmentAccelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUs
Docs
Forums
models
ReasoningVisionVisual DesignRetrievalSpeechBiologySimulationClimate & WeatherSafety & Moderation
industries
AutomotiveGamingHealthcareIndustrialRobotics

Reasoning

Developer Favorites

The top large language models for your enterprise AI

PREVIEW

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
PREVIEW

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
Run Anywhere

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

mathadvanced reasoningchat
Run Anywhere

nvidiallama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

advanced reasoningfunction callinginstruction followingmath

Combine Vision and Language Intelligence

Multimodal reasoning models

PREVIEW

microsoftkosmos-2

Groundbreaking multimodal model designed to understand and reason about visual elements in images.

image understandingmultimodalvisual question answeringcomputer visioncvimageimage-to-textvideovlm
PREVIEW

nvidianeva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

non-commercial use onlyvision assistantvisual question answeringcomputer visioncvimageimage-to-textvideovlm
PREVIEW

googledeplot

Translate images of plots into tables with one-shot visual language understanding.

multimodaldata ingestionnemo retrieverimage-to-text

Fresh Off the Press

The latest innovations in intelligence models

PREVIEW

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
PREVIEW

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answering
Run Anywhere

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

mathadvanced reasoningchat
Run Anywhere

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrievalvisual groundingvisual qaimage captioningimage-to-text
Run Anywhere

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image captioningimage-text retrievalvisual groundingvisual qaimage-to-text
Run Anywhere

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chatlanguage generationtext-to-textcode generation
Run Anywhere

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

language generationtext-to-textchatcode generation
Run Anywhere

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

chatsynthetic data generationcode generation
Run Anywhere

metallama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

chatlanguage generationtext-to-textcode generation
Run Anywhere

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

chatlanguage generationrun on rtxtext-to-textcode generation
PREVIEW

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

chatlanguage generationmoetext-to-textcode generation
PREVIEW

microsoftphi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

chatlanguage generationlarge language modelstext-to-textcode generation
PREVIEW

ai21labsjamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-text
PREVIEW

ai21labsjamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-text
PREVIEW

googlegemma-2-2b-it

Advanced small language generative AI model for edge applications

chatlanguage generationtext-to-textcode generation
PREVIEW

googleshieldgemma-9b

Guardrail model to ensure that responses from LLMs are appropriate and safe

guardrailtext-to-text
PREVIEW

writerpalmyra-fin-70b-32k

Specialized LLM for financial analysis, reporting, and data processing

financetext-to-text
PREVIEW

mistralaimamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completioncode generationcode generation
Run Anywhere

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

chatcode generationlanguage generationtext-to-textrun on rtx
PREVIEW

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatlanguage generationlarge language modelstext-to-text
PREVIEW

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatlanguage generationlarge language modelstext-to-text
PREVIEW

microsoftphi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chatlanguage generationlarge language modelstext-to-textcode generation
PREVIEW

baichuan-incbaichuan2-13b-chat

Support Chinese and English chat, coding, math, instruction following, solving quizzes

chatchinese language generationtext-to-texttext translation
PREVIEW

thudmchatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

chatregional language generationtext-to-textcode generationtext translation
PREVIEW

writerpalmyra-med-70b

Leading LLM for accurate, contextually relevant responses in the medical domain.

healthcaretext-to-text
PREVIEW

writerpalmyra-med-70b-32k

Leading LLM for accurate, contextually relevant responses in the medical domain.

healthcaretext-to-text