NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: Non-Commerical Use Only
Sorting by Most Recent

mistralaimistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generationimage-to-textmultimodalvisual question answeringmistralai

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

asrstreamingspeech-to-textmultilingualnvidia nimnvidia

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

image generationrun on rtxtext-to-imageblack-forest-labs

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answeringmeta

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answeringmeta

nvidiaAI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

climate scienceblueprintweather simulationai weather predictionnvidia aiearth-2nvidia

nvidiaSingle Cell Analysis

Investigate, understand, and interpret single cell data in minutes, not days by leveraging RAPIDS-singlecell, powered by NVIDIA RAPIDS

rapidsrna sequencingblueprintgenomicssingle cellbiologynvidia ainvidia

nvidiaGenomics Analysis

Easily run essential genomics workflows to save time leveraging Parabricks

parabricksblueprintgenomicsbiologydna sequencingnvidia ainvidia

nvidiacosmos-predict1-7b

Generates physics-aware video world states from text and image prompts for physical AI development.

synthetic data generationphysical airoboticstext-to-worldimage-to-worldnvidia

nvidiacosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

synthetic data generationphysical aipolicy evaluationroboticsvideo-to-worldnvidia

nvidiamagpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

ttstext-to-speechnvidia nimnvidia rivamultilingualnvidia

colabfoldmsa-search

Generates a multiple sequence alignment from a query sequence and a protein sequence database search.

nimbionemobiologydrug discoveryprotein foldingcolabfold

openfoldopenfold2

Predicts the 3D structure of a protein from its amino acid sequence, multiple sequence alignments, and templates.

biologynimbionemodrug discoveryprotein foldingopenfold

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textgoogle

microsoftphi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationmicrosoft

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognitionvisual qalanguage generationimage-to-textchart and table understandingmicrosoft

arcevo2-40b

Evo 2 is a biological foundation model that is able to integrate information over long genomic sequences while retaining sensitivity to single-nucleotide changes.

dna generationbiologynimbionemodrug discoveryarc

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

asrastspeech-to-textbatchwhisperopenaimultilingualnvidia nimnvidia rivaopenai

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asraststreamingspeech-to-textbatchspanishmultilingualnvidia nimnvidia rivanvidia

nvidiacanary-0.6b-turbo-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asrastfastspeech-to-textbatchmultilingualnvidia nimnvidia rivanvidia

igeniuscolosseum_355b_instruct_16k

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

tiiuaefalcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

codingcode generationlanguage generationimproved reasoningmathscientific knowledgetiiuae

igeniusitalia_10b_instruct_16k

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

nvidiaBuild A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

nvidia bionemoblueprintbionemobiologydrug discoveryprotein generationnvidia

nvidiagenmol

Fragment-Based Molecular Generation by Discrete Diffusion.

chemistrynimbionemomolecule generationdrug discoverynvidia

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

blueprintmulti-modaltext-to-speechconversational aipdf-to-podcastnvidia aiai agenttext-to-speechnvidia

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

nemo retrieverrun on rtxembeddingretrieval augmented generationtext-to-embeddingnvidia

nvidiallama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

nemo retrieverretrieval augmented generationrerankingnvidia

nvidiausdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

openusdsynthetic data generationdigital twincode generationchatnvidia nimnvidia

metallama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoningcode generationtext-to-textinstruction followingmathmeta

nvidiaaudio2face-3d

Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.

speech-to-animationdigital humansaudio-to-facenvidia nimnvidia

nvidiaconformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance

asrstreamingspeech-to-textspanishnvidia nimnvidia rivanvidia

nvidiacorrdiff

Generative downscaling model for generating high resolution regional scale weather fields.

ai weather predictionweather simulationearth-2nvidia

nvidiafourcastnet

FourCastNet predicts global atmospheric dynamics of various weather / climate variables.

weather simulationai weather predictionclimate scienceearth-2nvidia

nvidiallama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

code generationchattext-to-textlanguage generationnvidia

nvidiamistral-nemo-minitron-8b-8k-instruct

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

small language modelcode generationchattext-to-textlanguage generationnvidia

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage-to-textimage captioningvisual groundingmeta

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage captioningimage-to-textvisual groundingmeta

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

deepmindalphafold2-multimer

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

nvidiaconsistory

Generates consistent characters across a series of images without requiring additional training.

image generationtext-to-imagenvidia

metaesm2-650m

Generates embeddings of proteins from their amino acid sequences.

nimprotein embeddingbionemobiologydrug discoverymeta

deepmindalphafold2

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

yentinglinllama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generationchatcode generationlarge language modelsyentinglin

nvidiaBuild A Generative Virtual Screening Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.

chemistrynimnvidia bionemoblueprintbionemodockingdrug discoverynvidia

microsoftphi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textmicrosoft

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moecode generationchattext-to-textlanguage generationmicrosoft

microsoftphi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

nvidianv-dinov2

NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.

image-to-embeddingcomputer visiondeepstreamnvidia nimobject classificationnvidia

nvidianv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

object detectioncomputer visiondeepstreamnvidia nimnvidia

briaaiBRIA-2.3

An enterprise-grade text-to-image model trained on a compliant dataset produces high quality images.

image generationtext-to-imagebriaai

nvidiaradtts-hifigan-tts

Natural, high-fidelity, English voices for personalizing text-to-speech services and voiceovers

text-to-speechtext-to-speechnvidia nimnvidia

nvidiamegatron-1b-nmt

Enable smooth global interactions in 36 languages.

text translationneural machine translationnvidia nimnvidia

nvidiafastpitch-hifigan-tts

Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots

text-to-speechnvidia nimnvidia

nvidiaparakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

asrstreamingenglishspeech-to-textbatchnvidia nimnvidia

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

asrstreamingenglishbatchrun on rtxspeech-to-textfastnvidia nimnvidia

ipdproteinmpnn

ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.

biologynimbionemodrug discoveryprotein generationipd

microsoftflorence-2

Vision foundation model capable of performing diverse computer vision and vision language tasks.

image classificationimageobject detectioncvmultimodalvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-texttext-to-imagemicrosoft

googlegemma-2-2b-it

Advanced small language generative AI model for edge applications

code generationchattext-to-textlanguage generationgoogle

nvidiausdsearch

AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.

openusdsynthetic data generationdigital twinusdtext-to-3dnvidia nimnvidia

nvidiaaudio2face-2d

Create facial animations using a portrait photo and synchronize mouth movement with audio.

speech-to-animationtelepresencenvidia maxinedigital humannvidia

nvidiausdvalidate

Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.

validationopenusdsynthetic data generationdigital twinusdvisualization 3dnvidia

mistralaimamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completioncode generationcode generationmistralai

baichuan-incbaichuan2-13b-chat

Support Chinese and English chat, coding, math, instruction following, solving quizzes

chinese language generationtext translationchattext-to-textbaichuan-inc

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generationchatcode generationmeta

metallama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

run on rtxcode generationchattext-to-textlanguage generationmeta

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

run on rtxcode generationchatlanguage generationtext-to-textnv-mistralai

nvidianv-rerankqa-mistral-4b-v3

Multilingual text reranking model.

nemo retrieverrerankingretrieval augmented generationnvidia

nvidianv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

embeddingretrieval augmented generationnemo retrievertext-to-embeddingnvidia

nvidiamaisi

MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.

image generationmedical imagingnvidia nimnvidia

microsoftphi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

bigcodestarcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

bigcodestarcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

googlegemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

code generationchattext-to-textlanguage generationgoogle

googlegemma-2-9b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

chatcode generationtext-to-textlanguage generationgoogle

nvidiallama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

nvidiallama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

01-aiyi-large

Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.

code generationchattext-to-textmultilingual01-ai

stabilityaistable-diffusion-3-medium

Advanced text-to-image model for generating high quality images

image generationtext-to-imagestabilityai

nvidiaocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

optical character recognitionimageoptical character detectioncvvlmcomputer visiontao toolkitvideonvidia

nvidianv-embed-v1

Generates high-quality numerical embeddings from text inputs.

non-commercial use onlyretrieval augmented generationtext-to-embeddingnvidia

upstagesolar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use onlychattext-to-textlanguage generationlarge language modelsupstage

baaibge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

embeddingsretrieval augmented generationtext-to-embeddingbaai

nvidiavisual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

imageimage generationcvimage segmentationvlmcomputer visiontao toolkitvideonvidia nimnvidia

googlecodegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

code generationcode completiongoogle

ibmgranite-34b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

code generationchatlarge language modelstext-to-codeibm

ibmgranite-8b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

code generationchatlarge language modelstext-to-codeibm

nvidiaretail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

object detectionimagecvvlmcomputer visiontao toolkitvideonvidia nimnvidia

ipdrfdiffusion

A generative model of protein backbones for protein binder design.

biologynimbionemodrug discoveryprotein generationipd

microsoftphi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideomicrosoft

googlepaligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideogoogle

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

snowflakearctic-embed-l

Optimized community model for text embedding.

nemo retrieverembeddingretrieval augmented generationtext-to-embeddingsnowflake

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai

metallama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

large language modelscode generationchattext-to-textlanguage generationmeta

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationlarge language modelsmeta

googlerecurrentgemma-2b

Novel recurrent architecture based language model for faster inference when generating long sequences.

code generationchattext-to-textlanguage generationgoogle

googlecodegemma-7b

Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

code generationchatlanguage generationtext-to-codegoogle

googlegemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

code generationchattext-to-textlanguage generationgoogle

nvidiaembed-qa-4

GPU-accelerated generation of text embeddings used for question-answering retrieval.

embeddingsretrieval augmented generationtext-to-embeddingnvidia

nvidiarerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

rankingretrieval augmented generationnvidia

stabilityaistable-diffusion-xl

Generate images and stunning visuals with realistic aesthetics.

image generationtext-to-imagestabilityai

microsoftkosmos-2

Groundbreaking multimodal model designed to understand and reason about visual elements in images.

imagecvmultimodalvlmvisual question answeringcomputer visionimage understandingimage-to-textvideomicrosoft

googledeplot

Translate images of plots into tables with one-shot visual language understanding.

nemo retrievermultimodaldata ingestionimage-to-textgoogle

nvidianeva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

imagecvvision assistantnon-commercial use onlyvlmvisual question answeringcomputer visionimage-to-textvideonvidia

adeptfuyu-8b

Multi-modal model for a wide range of tasks, including image understanding and language generation.

imagecvmultimodalvlmcomputer visionimage understandinglanguage generationimage-to-textvideoadept

nvidiavista-3d

VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.

interactive annotationimage segmentationnon-commercial use onlymedical imagingnvidia

googlegemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

code generationchattext-to-textlanguage generationgoogle

nvidiafq2bam

Generate BAM output given one or more pairs of FASTQ files, by running BWA-MEM & GATK best practices.

parabricksgenomicsdna sequencingnvidia

nvidiadeepvariant

Run Google's DeepVariant optimized for GPU. Switch models for high accuracy on all major sequencers.

parabricksgenomicsdna sequencingnvidia

stabilityaistable-video-diffusion

Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.

image generationtext-to-imagestabilityai

stabilityaisdxl-turbo

A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

image generationtext-to-imagestabilityai

nvidiamolmim

MolMIM performs controlled generation, finding molecules with the right properties.

chemistrynimbionemomolecule generationdrug discoverynvidia

mitdiffdock

Predicts the 3D structure of how a molecule interacts with a protein.

chemistrynimbionemodockingdrug discoverymit

mistralaimixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai

nvidiacuopt

World-record accuracy and performance for complex route optimization.

route optimizationnvidia