NVIDIA
Explore Models Blueprints GPUs Docs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Publisher
Use Case
NIM Type
Sorting by Most Recent

deepseek-aideepseek-r1-0528

Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.

codingchatmathadvanced reasoningdeepseek-ai

nvidiallama-3.1-nemotron-nano-4b-v1.1

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

edgetool callingreasoningmathnvidia

marinmarin-8b-instruct

State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.

reasoningchatscienceopen modelmathmarin

ibmgranite-3.3-8b-instruct

Small language model fine-tuned for improved reasoning, coding, and instruction-following

codingreasoninginstruction followingibm

qwenqwen3-235b-a22b

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

complex mathadvanced reasoninginstruction followingqwen

mistralaimistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generationimage-to-textmultimodalvisual question answeringmistralai

nvidiallama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

mathadvanced reasoninginstruction followingfunction callingnvidia

qwenqwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

codingchatmathadvanced reasoningqwen

nvidiabevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

autonomous vehiclesbevautomotiveperceptionnvidia

nvidiallama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

mathadvanced reasoninginstruction followingfunction callingnvidia

nvidiallama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

mathadvanced reasoninginstruction followingfunction callingnvidia

deepseek-aideepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillationcodingchatreasoningrun-on-rtxmathdeepseek-ai

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textgoogle

googlegemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

translationchatchattext-to-textlanguage generationgoogle

deepseek-aideepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationchatreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationchatreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationchatreasoningmathdeepseek-ai

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognitionvisual qalanguage generationimage-to-textchart and table understandingmicrosoft

mistralaimistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

codechatreasoningagent-centricmultilingualmistralai

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chatmathadvanced reasoningdeepseek-ai

tiiuaefalcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

codingchatcode generationlanguage generationimproved reasoningmathscientific knowledgetiiuae

qwenqwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchattext-to-textlarge language modelsqwen

qwenqwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completioncode generationchattext-to-codeqwen

metallama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoningcode generationtext-to-textinstruction followingmathmeta

hivedeepfake-image-detection

Advanced AI model detects faces and identifies deep fake images.

computer visionai safetydeep fake detectioncontent moderationhive

zyphrazamba2-7b-instruct

Efficient hybrid state-space model designed for conversational and reasoning tasks.

chatchatlanguage generationtext-to-textzyphra

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chatcode generationchattext-to-textlanguage generationmeta

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage-to-textimage captioningvisual groundingmeta

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage captioningimage-to-textvisual groundingmeta

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chatcode generationtext-to-textlanguage generationmeta

qwenqwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchatchattext-to-textlarge language modelsqwen

microsoftphi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textmicrosoft

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moechatcode generationchattext-to-textlanguage generationmicrosoft

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatchattext-to-textlanguage generationlarge language modelsrakuten

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatchattext-to-textlanguage generationlarge language modelsrakuten

googlegemma-2-2b-it

Advanced small language generative AI model for edge applications

chatcode generationchattext-to-textlanguage generationgoogle

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generationchatcode generationmeta

metallama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationrun-on-rtxmeta

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

code generationchatlanguage generationtext-to-textrun-on-rtxnv-mistralai

microsoftphi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

bigcodestarcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

bigcodestarcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

nvidiallama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textchatnon-commercial use onlychatnvidia

nvidiallama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textchatnon-commercial use onlynvidia

stabilityaistable-diffusion-3-medium

Advanced text-to-image model for generating high quality images

image generationtext-to-imagestabilityai

upstagesolar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use onlychattext-to-textlanguage generationlarge language modelsupstage

googlecodegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

chatcode generationcode completiongoogle

microsoftphi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideomicrosoft

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningchatcode generationchattext-to-textlarge language modelsmistralai

metallama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

chatlarge language modelscode generationchattext-to-textlanguage generationmeta

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatcode generationchattext-to-textlanguage generationlarge language modelsmeta

googlegemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

chatcode generationchattext-to-textlanguage generationgoogle

mistralaimixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningchatcode generationchattext-to-textlarge language modelsmistralai