NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Publisher
Use Case
NIM Type
Sorting by Most Recent

gotocompanygemma-2-9b-cpt-sahabatai-instruct

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

sovereign aichatindonesianchattext-to-textregional language generationgotocompany

googlegemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

translationchatchattext-to-textlanguage generationgoogle

microsoftphi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationmicrosoft

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chatmathadvanced reasoningdeepseek-ai

nvidiallama-3.1-nemoguard-8b-topic-control

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

dialogue safetyllm safetyguard modelcontent safetynvidia

nvidianemoguard-jailbreak-detect

Industry leading jailbreak classification model for protection from adversarial attempts

llm securityjailbreak detectionprompt injectionnvidia nimnvidia

nvidiallama-3.1-nemoguard-8b-content-safety

Leading content safety model for enhancing the safety and moderation capabilities of LLMs

llm safetycontent moderationguard modelcontent safetynvidia

igeniuscolosseum_355b_instruct_16k

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

heavy industrygovernmentchathighly regulated use case supportfinancial servicesigenius

tiiuaefalcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

codingchatcode generationlanguage generationimproved reasoningmathscientific knowledgetiiuae

igeniusitalia_10b_instruct_16k

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

heavy industrygovernmentchathighly regulated use case supportfinancial servicesigenius

qwenqwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchattext-to-textlarge language modelsqwen

nvidiacosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlmvision language modelimage captionimage to textnvidia

qwenqwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completioncode generationchattext-to-codeqwen

writerpalmyra-creative-122b

Powerful LLM designed for creative thinking and writing.

content generationchatchattext-to-textwriter

nvidiausdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

openusdsynthetic data generationdigital twincode generationchatnvidia nimnvidia

metallama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoningcode generationtext-to-textinstruction followingmathmeta

nvidianemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

indicchatchattext-to-textlanguage generationnvidia

nvidiallama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

chatcode generationchattext-to-textlanguage generationnvidia

qwenqwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchatchattext-to-textlarge language modelsqwen

nvidiavila

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlmvision language modelimage captionimage to textnvidia

tokyotech-llmllama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language modelchatregional language generationtokyotech-llm

ai21labsjamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatchatlanguage generationtext-to-textai21labs

ai21labsjamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatchatlanguage generationtext-to-textai21labs

nvidianemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

chattext-to-textlanguage generationnvidia

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moechatcode generationchattext-to-textlanguage generationmicrosoft

microsoftphi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatchattext-to-textlanguage generationlarge language modelsrakuten

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatchattext-to-textlanguage generationlarge language modelsrakuten

microsoftflorence-2

Vision foundation model capable of performing diverse computer vision and vision language tasks.

image classificationimageobject detectioncvmultimodalvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-texttext-to-imagemicrosoft

writerpalmyra-fin-70b-32k

Specialized LLM for financial analysis, reporting, and data processing

chatfinancetext-to-textwriter

googleshieldgemma-9b

Guardrail model to ensure that responses from LLMs are appropriate and safe

guardrailtext-to-textgoogle

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generationchatcode generationmeta

nvidiallama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textchatnon-commercial use onlychatnvidia

nvidiallama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textchatnon-commercial use onlynvidia

mistralaimistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

chattext-to-textlanguage generationmistralai

nvidiaocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

optical character recognitionimageoptical character detectioncvvlmcomputer visiontao toolkitvideonvidia

writerpalmyra-med-70b-32k

Leading LLM for accurate, contextually relevant responses in the medical domain.

chattext-to-texthealthcarewriter

writerpalmyra-med-70b

Leading LLM for accurate, contextually relevant responses in the medical domain.

chattext-to-texthealthcarewriter

mediatekbreeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

chatchattext-to-textregional language generationmediatek

nvidiavisual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

imageimage generationcvimage segmentationvlmcomputer visiontao toolkitvideonvidia nimnvidia

ibmgranite-34b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

chatcode generationchatlarge language modelstext-to-codeibm

ibmgranite-8b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

chatcode generationchatlarge language modelstext-to-codeibm

nvidiaretail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

object detectionimagecvvlmcomputer visiontao toolkitvideonvidia nimnvidia

microsoftphi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideomicrosoft

googlepaligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideogoogle

aisingaporesea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

chattext-to-textregional language generationlarge language modelsaisingapore

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

databricksdbrx-instruct

A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.

chatchattext-to-textlanguage generationlarge language modelsdatabricks

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatcode generationchattext-to-textlanguage generationlarge language modelsmicrosoft

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningchatcode generationchattext-to-textlarge language modelsmistralai

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatcode generationchattext-to-textlanguage generationlarge language modelsmeta

microsoftkosmos-2

Groundbreaking multimodal model designed to understand and reason about visual elements in images.

imagecvmultimodalvlmvisual question answeringcomputer visionimage understandingimage-to-textvideomicrosoft

nvidianeva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

imagecvvision assistantnon-commercial use onlyvlmvisual question answeringcomputer visionimage-to-textvideonvidia

adeptfuyu-8b

Multi-modal model for a wide range of tasks, including image understanding and language generation.

imagecvmultimodalvlmcomputer visionimage understandinglanguage generationimage-to-textvideoadept

mistralaimistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

chattext-to-textlanguage generationnvidia nimmistralai

mistralaimixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningchatcode generationchattext-to-textlarge language modelsmistralai