NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: Run on rtx
Sorting by Most Recent

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

asrstreamingspeech-to-textmultilingualnvidia nimnvidia

nvidia3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

blueprintrun on rtxnvidia ainvidia

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

image generationrun on rtxtext-to-imageblack-forest-labs

qwenqwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

codingchatmathadvanced reasoningqwen

nvidiaBuild an AI Agent for Research and Reporting

Create AI agents that reason, plan, reflect and refine to produce high-quality reports based on source materials of your choice.

nimllama nemotronreasoningblueprintretrieval-augmented generationnvidia ainemo retrievernvidia

nvidiaSingle Cell Analysis

Investigate, understand, and interpret single cell data in minutes, not days by leveraging RAPIDS-singlecell, powered by NVIDIA RAPIDS

rapidsrna sequencingblueprintgenomicssingle cellbiologynvidia ainvidia

nvidiaGenomics Analysis

Easily run essential genomics workflows to save time leveraging Parabricks

parabricksblueprintgenomicsbiologydna sequencingnvidia ainvidia

siemenssimcenter-star-ccm+

Run computational-fluid dynamics (CFD) simulations

aerodynamicscaefluid-dynamicssimulationheat-transfercomputer-aided engineeringsiemens

cadencespectre-x

Run large-scale electronics and chip design verification simulations

chip-designelectronic-design-automationedasemiconductorintegrated-circuitsdesign-verificationsimulationscadence

cadencefidelity

Run computational-fluid dynamics (CFD) simulations

aerodynamicscaefluid-dynamicssimulationheat-transfercomputer-aided engineeringcadence

ansysfluent

Run computational-fluid dynamics (CFD) simulations

aerodynamicscaefluid-dynamicssimulationheat-transfercomputer-aided engineeringansys

nvidiaSynthetic Manipulation Motion Generation for Robotics

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

nvidia omniverseblueprintsynthetic dataroboticsphysical airobot learninghumanoidsnvidia isaac gr00ttext-to-worldimage-to-worldteleopnvidia

nvidiacosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

synthetic data generationphysical aipolicy evaluationroboticsvideo-to-worldnvidia

nvidiaTest Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

industrialnvidia omniverseblueprintsimulationomniverse blueprintnvidia

nvidiabevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

autonomous vehiclesbevautomotiveperceptionnvidia

nvidiamagpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

ttstext-to-speechnvidia nimnvidia rivamultilingualnvidia

deepseek-aideepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillationcodingrun on rtxreasoningmathdeepseek-ai

nvidianemoretriever-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

nvidianemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

nvidianemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

openfoldopenfold2

Predicts the 3D structure of a protein from its amino acid sequence, multiple sequence alignments, and templates.

biologynimbionemodrug discoveryprotein foldingopenfold

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textgoogle

nvidianemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

optical character recognitionnemo retrieverdata ingestiontable extractionsupported language - englishnvidia

deepseek-aideepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

microsoftphi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationmicrosoft

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognitionvisual qalanguage generationimage-to-textchart and table understandingmicrosoft

mistralaimistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

codereasoningagent-centricmultilingualmistralai

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chatmathadvanced reasoningdeepseek-ai

nvidiaBuild an Enterprise RAG pipeline

Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.

nemo retrievernimblueprintretrieval-augmented generationnvidia ainvidia

nvidiallama-3.1-nemoguard-8b-topic-control

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

dialogue safetyllm safetyguard modelcontent safetynvidia

nvidiallama-3.1-nemoguard-8b-content-safety

Leading content safety model for enhancing the safety and moderation capabilities of LLMs

llm safetycontent moderationguard modelcontent safetynvidia

igeniuscolosseum_355b_instruct_16k

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

tiiuaefalcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

codingcode generationlanguage generationimproved reasoningmathscientific knowledgetiiuae

igeniusitalia_10b_instruct_16k

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

blueprintmulti-modaltext-to-speechconversational aipdf-to-podcastnvidia aiai agenttext-to-speechnvidia

langchainStructured Report Generation

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

langgraphreport generationai agentsblueprintpartnernvidia ailangchain

qwenqwen2.5-coder-7b-instruct

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

code completioncode generationtext-to-codeqwen

metasam2

SAM 2 is a segmentation model that enables fast, precise selection of any object in any video or image.

metacomputer visionsegmentationvideometa

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

nemo retrieverrun on rtxembeddingretrieval augmented generationtext-to-embeddingnvidia

nvidiausdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

openusdsynthetic data generationdigital twincode generationchatnvidia nimnvidia

nvidianv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectiondata ingestionchart detectionnemo retrievertable detectionrun on rtxextractionnvidia

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

optical character recognitiontable extractionoptical character detectionnemo retrieverrun on rtxdata ingestionextractionbaidu

nvidiaconformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance

asrstreamingspeech-to-textspanishnvidia nimnvidia rivanvidia

nvidiafourcastnet

FourCastNet predicts global atmospheric dynamics of various weather / climate variables.

weather simulationai weather predictionclimate scienceearth-2nvidia

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

visionvideo-to-textgenerative aiblueprintchatnvidia ainvidia

nvidiaBuild an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

customer serviceblueprintretrieval-augmented generationllmcontact centernvidia ainvidia

nvidianemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

indicchattext-to-textlanguage generationnvidia

ibmgranite-3.0-3b-a800m-instruct

Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification

small language modelmoelanguage generationtext-to-textibm

nvidiallama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

code generationchattext-to-textlanguage generationnvidia

institute-of-science-tokyollama-3.1-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ailarge language modelchatregional language generationinstitute-of-science-tokyo

institute-of-science-tokyollama-3.1-swallow-8b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ailarge language modelchatregional language generationinstitute-of-science-tokyo

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

run on rtxnvidia maxinespeech-to-speechdigital humanspeech enhancementnvidia

nvidiamistral-nemo-minitron-8b-8k-instruct

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

small language modelcode generationchattext-to-textlanguage generationnvidia

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage-to-textimage captioningvisual groundingmeta

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage captioningimage-to-textvisual groundingmeta

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

nvidiallama-3.1-nemotron-51b-instruct

Unique language model that delivers an unmatched accuracy-efficiency performance.

language generationchattext-to-textnvidia

deepmindalphafold2-multimer

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

nvidiaconsistory

Generates consistent characters across a series of images without requiring additional training.

image generationtext-to-imagenvidia

metaesm2-650m

Generates embeddings of proteins from their amino acid sequences.

nimprotein embeddingbionemobiologydrug discoverymeta

deepmindalphafold2

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

yentinglinllama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generationchatcode generationlarge language modelsyentinglin

tokyotech-llmllama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language modelchatregional language generationtokyotech-llm

microsoftphi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textmicrosoft

ai21labsjamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-textai21labs

ai21labsjamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-textai21labs

nvidianemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

chattext-to-textlanguage generationnvidia

nvidiamistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generationtext-to-textchatsmall language modelnvidia

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moecode generationchattext-to-textlanguage generationmicrosoft

microsoftphi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chattext-to-textlanguage generationlarge language modelsrakuten

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chattext-to-textlanguage generationlarge language modelsrakuten

nvidianv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

object detectioncomputer visiondeepstreamnvidia nimnvidia

briaaiBRIA-2.3

An enterprise-grade text-to-image model trained on a compliant dataset produces high quality images.

image generationtext-to-imagebriaai

nvidiamegatron-1b-nmt

Enable smooth global interactions in 36 languages.

text translationneural machine translationnvidia nimnvidia

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

asrstreamingenglishbatchrun on rtxspeech-to-textfastnvidia nimnvidia

microsoftflorence-2

Vision foundation model capable of performing diverse computer vision and vision language tasks.

image classificationimageobject detectioncvmultimodalvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-texttext-to-imagemicrosoft

nvidiausdsearch

AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.

openusdsynthetic data generationdigital twinusdtext-to-3dnvidia nimnvidia

GettyImagesedify-image

Getty Images’ API service for 4K image generation. Trained on NVIDIA Edify using Getty Images' commercially safe creative libraries.

outpaintimage generationreplaceimage modificationinpaintgettyimages

nvidiaeyecontact

Estimate gaze angles of a person in a video and redirect to make it frontal.

telepresencenvidia maxinedigital humannvidia

nvidiausdvalidate

Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.

validationopenusdsynthetic data generationdigital twinusdvisualization 3dnvidia

mistralaimamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completioncode generationcode generationmistralai

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

run on rtxcode generationchattext-to-textlanguage generationmeta

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

run on rtxcode generationchatlanguage generationtext-to-textnv-mistralai

microsoftphi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

nvidiallama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

nvidiallama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

01-aiyi-large

Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.

code generationchattext-to-textmultilingual01-ai

nvidianvclip

NV-CLIP is a multimodal embeddings model for image and text.

computer visionmultimodal embeddingstext and imagerun on rtxnvidia nimnvidia

writerpalmyra-med-70b-32k

Leading LLM for accurate, contextually relevant responses in the medical domain.

text-to-texthealthcarewriter

writerpalmyra-med-70b

Leading LLM for accurate, contextually relevant responses in the medical domain.

text-to-texthealthcarewriter

nvidianv-embed-v1

Generates high-quality numerical embeddings from text inputs.

non-commercial use onlyretrieval augmented generationtext-to-embeddingnvidia

upstagesolar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use onlychattext-to-textlanguage generationlarge language modelsupstage

baaibge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

embeddingsretrieval augmented generationtext-to-embeddingbaai

mediatekbreeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

chattext-to-textregional language generationmediatek

nvidiaretail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

object detectionimagecvvlmcomputer visiontao toolkitvideonvidia nimnvidia

ipdrfdiffusion

A generative model of protein backbones for protein binder design.

biologynimbionemodrug discoveryprotein generationipd

microsoftphi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideomicrosoft

aisingaporesea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

chattext-to-textregional language generationlarge language modelsaisingapore

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

databricksdbrx-instruct

A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.

chattext-to-textlanguage generationlarge language modelsdatabricks

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationlarge language modelsmeta

googlecodegemma-7b

Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

code generationchatlanguage generationtext-to-codegoogle

googlegemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

code generationchattext-to-textlanguage generationgoogle

nvidiaembed-qa-4

GPU-accelerated generation of text embeddings used for question-answering retrieval.

embeddingsretrieval augmented generationtext-to-embeddingnvidia

microsoftkosmos-2

Groundbreaking multimodal model designed to understand and reason about visual elements in images.

imagecvmultimodalvlmvisual question answeringcomputer visionimage understandingimage-to-textvideomicrosoft

googledeplot

Translate images of plots into tables with one-shot visual language understanding.

nemo retrievermultimodaldata ingestionimage-to-textgoogle

nvidianeva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

imagecvvision assistantnon-commercial use onlyvlmvisual question answeringcomputer visionimage-to-textvideonvidia

adeptfuyu-8b

Multi-modal model for a wide range of tasks, including image understanding and language generation.

imagecvmultimodalvlmcomputer visionimage understandinglanguage generationimage-to-textvideoadept

nvidiavista-3d

VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.

interactive annotationimage segmentationnon-commercial use onlymedical imagingnvidia

nvidiafq2bam

Generate BAM output given one or more pairs of FASTQ files, by running BWA-MEM & GATK best practices.

parabricksgenomicsdna sequencingnvidia

nvidiadeepvariant

Run Google's DeepVariant optimized for GPU. Switch models for high accuracy on all major sequencers.

parabricksgenomicsdna sequencingnvidia

stabilityaisdxl-turbo

A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

image generationtext-to-imagestabilityai

metaesmfold

Predicts the 3D structure of a protein from its amino acid sequence.

biologynimbionemoprotein foldingdrug discoverymeta

mitdiffdock

Predicts the 3D structure of how a molecule interacts with a protein.

chemistrynimbionemodockingdrug discoverymit

mistralaimixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai