Try NVIDIA NIM APIs

Explore Models Blueprints GPUs Docs

|

|

Manage My Privacy

|

Copyright © 2025 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Publisher

Use Case

NIM Type

Sorting by Most Recent

deepseek-ai deepseek-r1-0528

Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.

coding chat math advanced reasoning deepseek-ai

nvidia llama-3.1-nemotron-nano-4b-v1.1

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

edge tool calling reasoning math nvidia

marin marin-8b-instruct

State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.

reasoning chat science open model math marin

ibm granite-3.3-8b-instruct

Small language model fine-tuned for improved reasoning, coding, and instruction-following

coding reasoning instruction following ibm

qwen qwen3-235b-a22b

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

complex math advanced reasoning instruction following qwen

mistralai mistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generation image-to-text multimodal visual question answering mistralai

nvidia llama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

math advanced reasoning instruction following function calling nvidia

qwen qwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

coding chat math advanced reasoning qwen

nvidia bevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

autonomous vehicles bev automotive perception nvidia

nvidia llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

math advanced reasoning instruction following function calling nvidia

nvidia llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

math advanced reasoning instruction following function calling nvidia

deepseek-ai deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillation coding chat reasoning run-on-rtx math deepseek-ai

google gemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistant visual question answering language generation image-to-text google

google gemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

translation chat chat text-to-text language generation google

deepseek-ai deepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

deepseek-ai deepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

deepseek-ai deepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

microsoft phi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognition visual qa language generation image-to-text chart and table understanding microsoft

mistralai mistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

code chat reasoning agent-centric multilingual mistralai

deepseek-ai deepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chat math advanced reasoning deepseek-ai

tiiuae falcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

coding chat code generation language generation improved reasoning math scientific knowledge tiiuae

qwen qwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generation chat text-to-text large language models qwen

qwen qwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completion code generation chat text-to-code qwen

meta llama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoning code generation text-to-text instruction following math meta

hive deepfake-image-detection

Advanced AI model detects faces and identifies deep fake images.

computer vision ai safety deep fake detection content moderation hive

zyphra zamba2-7b-instruct

Efficient hybrid state-space model designed for conversational and reasoning tasks.

chat chat language generation text-to-text zyphra

meta llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation meta

meta llama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image-to-text image captioning visual grounding meta

meta llama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrieval visual qa image captioning image-to-text visual grounding meta

meta llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chat code generation text-to-text language generation meta

qwen qwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generation chat chat text-to-text large language models qwen

microsoft phi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistant visual question answering language generation image-to-text microsoft

microsoft phi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moe chat code generation chat text-to-text language generation microsoft

rakuten rakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat chat text-to-text language generation large language models rakuten

rakuten rakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat chat text-to-text language generation large language models rakuten

google gemma-2-2b-it

Advanced small language generative AI model for edge applications

chat code generation chat text-to-text language generation google

meta llama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generation chat code generation meta

meta llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generation chat text-to-text language generation meta

meta llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

code generation chat text-to-text language generation run-on-rtx meta

nv-mistralai mistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

code generation chat language generation text-to-text run-on-rtx nv-mistralai

microsoft phi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

bigcode starcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

bigcode starcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completion code generation code generation bigcode

nvidia llama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-text chat non-commercial use only chat nvidia

nvidia llama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-text chat non-commercial use only nvidia

stabilityai stable-diffusion-3-medium

Advanced text-to-image model for generating high quality images

image generation text-to-image stabilityai

upstage solar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use only chat text-to-text language generation large language models upstage

google codegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

chat code generation code completion google

microsoft phi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

image cv vision assistant vlm visual question answering computer vision language generation image-to-text video microsoft

microsoft phi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chat code generation chat text-to-text language generation large language models microsoft

microsoft phi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chat code generation chat text-to-text language generation large language models microsoft

mistralai mixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai

meta llama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

chat large language models code generation chat text-to-text language generation meta

meta llama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chat code generation chat text-to-text language generation large language models meta

google gemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

chat code generation chat text-to-text language generation google

mistralai mixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai