NVIDIA
Explore Models Blueprints GPUs
Terms of Use

|

Privacy Policy

|

Manage My Privacy

|

Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: nvidia nim
Sorting by Most Recent

mistralaimistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generationimage-to-textmultimodalvisual question answeringmistralai

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

asrstreamingspeech-to-textmultilingualnvidia nimnvidia

nvidia3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

blueprintrun on rtxnvidia ainvidia

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

image generationrun on rtxtext-to-imageblack-forest-labs

nvidiallama-3.1-nemotron-ultra-253b-v1

Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.

mathadvanced reasoninginstruction followingfunction callingnvidia

metallama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answeringmeta

metallama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generationimage-to-textvision assistantvisual question answeringmeta

qwenqwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

codingchatmathadvanced reasoningqwen

nvidiaBuild an AI Agent for Research and Reporting

Create AI agents that reason, plan, reflect and refine to produce high-quality reports based on source materials of your choice.

nimllama nemotronreasoningblueprintretrieval-augmented generationnvidia ainemo retrievernvidia

nvidiaAI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

climate scienceblueprintweather simulationai weather predictionnvidia aiearth-2nvidia

nvidiaSingle Cell Analysis

Investigate, understand, and interpret single cell data in minutes, not days by leveraging RAPIDS-singlecell, powered by NVIDIA RAPIDS

rapidsrna sequencingblueprintgenomicssingle cellbiologynvidia ainvidia

nvidiaGenomics Analysis

Easily run essential genomics workflows to save time leveraging Parabricks

parabricksblueprintgenomicsbiologydna sequencingnvidia ainvidia

nvidiaSynthetic Manipulation Motion Generation for Robotics

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

nvidia omniverseblueprintsynthetic dataroboticsphysical airobot learninghumanoidsnvidia isaac gr00ttext-to-worldimage-to-worldteleopnvidia

nvidiacosmos-predict1-7b

Generates physics-aware video world states from text and image prompts for physical AI development.

synthetic data generationphysical airoboticstext-to-worldimage-to-worldnvidia

nvidiacosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

synthetic data generationphysical aipolicy evaluationroboticsvideo-to-worldnvidia

nvidiaTest Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

industrialnvidia omniverseblueprintsimulationomniverse blueprintnvidia

nvidiasparsedrive

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

autonomous vehiclesbevav stackautomotivenvidia

nvidiabevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

autonomous vehiclesbevautomotiveperceptionnvidia

nvidiallama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

mathadvanced reasoninginstruction followingfunction callingnvidia

nvidiallama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

mathadvanced reasoninginstruction followingfunction callingnvidia

nvidiamagpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

ttstext-to-speechnvidia nimnvidia rivamultilingualnvidia

nvidianv-embedcode-7b-v1

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

nemo retrieverembeddingretrieval augmented generationnvidia

deepseek-aideepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillationcodingrun on rtxreasoningmathdeepseek-ai

nvidianemoretriever-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

nvidianemoretriever-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

nvidianemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectionchart detectionnemo retrievertable detectiondata ingestionnvidia

colabfoldmsa-search

Generates a multiple sequence alignment from a query sequence and a protein sequence database search.

nimbionemobiologydrug discoveryprotein foldingcolabfold

openfoldopenfold2

Predicts the 3D structure of a protein from its amino acid sequence, multiple sequence alignments, and templates.

biologynimbionemodrug discoveryprotein foldingopenfold

googlegemma-3-27b-it

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textgoogle

googlegemma-3-1b-it

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

translationchattext-to-textlanguage generationgoogle

nvidianemoretriever-parse

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

optical character recognitionnemo retrieverdata ingestiontable extractionsupported language - englishnvidia

nvidiaLLM Router

Route LLM requests to the best model for the task at hand.

blueprintllm routernvidia ainvidia

deepseek-aideepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

deepseek-aideepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

codingdistillationreasoningmathdeepseek-ai

microsoftphi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationmicrosoft

microsoftphi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

speech recognitionvisual qalanguage generationimage-to-textchart and table understandingmicrosoft

nvidiaEvo 2 Protein Design

This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.

protein generationdrug discoveryblueprintnvidia

arcevo2-40b

Evo 2 is a biological foundation model that is able to integrate information over long genomic sequences while retaining sensitivity to single-nucleotide changes.

dna generationbiologynimbionemodrug discoveryarc

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

asrastspeech-to-textbatchwhisperopenaimultilingualnvidia nimnvidia rivaopenai

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asraststreamingspeech-to-textbatchspanishmultilingualnvidia nimnvidia rivanvidia

nvidiacanary-0.6b-turbo-asr

Multi-lingual model supporting speech-to-text recognition and translation.

asrastfastspeech-to-textbatchmultilingualnvidia nimnvidia rivanvidia

mistralaimistral-small-24b-instruct

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

codereasoningagent-centricmultilingualmistralai

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chatmathadvanced reasoningdeepseek-ai

nvidiaBuild an Enterprise RAG pipeline

Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.

nemo retrievernimblueprintretrieval-augmented generationnvidia ainvidia

nvidiallama-3.1-nemoguard-8b-topic-control

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

dialogue safetyllm safetyguard modelcontent safetynvidia

nvidianemoguard-jailbreak-detect

Industry leading jailbreak classification model for protection from adversarial attempts

llm securityjailbreak detectionprompt injectionnvidia nimnvidia

nvidiallama-3.1-nemoguard-8b-content-safety

Leading content safety model for enhancing the safety and moderation capabilities of LLMs

llm safetycontent moderationguard modelcontent safetynvidia

igeniuscolosseum_355b_instruct_16k

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

tiiuaefalcon3-7b-instruct

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

codingcode generationlanguage generationimproved reasoningmathscientific knowledgetiiuae

igeniusitalia_10b_instruct_16k

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

heavy industrygovernmenthighly regulated use case supportfinancial servicesigenius

qwenqwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchattext-to-textlarge language modelsqwen

nvidiaBuild A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

nvidia bionemoblueprintbionemobiologydrug discoveryprotein generationnvidia

nvidiagenmol

Fragment-Based Molecular Generation by Discrete Diffusion.

chemistrynimbionemomolecule generationdrug discoverynvidia

nvidiaPDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

blueprintmulti-modaltext-to-speechconversational aipdf-to-podcastnvidia aiai agenttext-to-speechnvidia

pipecatVoice Agent Framework for Conversational AI

Automate voice AI agents with NVIDIA NIM microservices and Pipecat.

pipecatai agentsblueprintconversational aipartnernvidia aipipecat

llamaindexDocument Research Assistant for Blog Creation

Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

blog creationai agentsblueprintpartnerllamaindexnvidia aillamaindex

langchainStructured Report Generation

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

langgraphreport generationai agentsblueprintpartnernvidia ailangchain

crewaiCode Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

code documentationcrewaiai agentsblueprintpartnernvidia aicrewai

nvidiacosmos-nemotron-34b

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlmvision language modelimage captionimage to textnvidia

qwenqwen2.5-coder-32b-instruct

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

code completioncode generationtext-to-codeqwen

qwenqwen2.5-coder-7b-instruct

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

code completioncode generationtext-to-codeqwen

metasam2

SAM 2 is a segmentation model that enables fast, precise selection of any object in any video or image.

metacomputer visionsegmentationvideometa

writerpalmyra-creative-122b

Powerful LLM designed for creative thinking and writing.

content generationchattext-to-textwriter

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

nemo retrieverrun on rtxembeddingretrieval augmented generationtext-to-embeddingnvidia

nvidiallama-3.2-nv-rerankqa-1b-v2

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

nemo retrieverretrieval augmented generationrerankingnvidia

nvidiausdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

openusdsynthetic data generationdigital twincode generationchatnvidia nimnvidia

metallama-3.3-70b-instruct

Advanced LLM for reasoning, math, general knowledge, and function calling

reasoningcode generationtext-to-textinstruction followingmathmeta

university-at-buffalocached

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

nemo retrieverchart element detectionimage-to-textuniversity-at-buffalo

nvidianv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

object detectiondata ingestionchart detectionnemo retrievertable detectionrun on rtxextractionnvidia

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

optical character recognitiontable extractionoptical character detectionnemo retrieverrun on rtxdata ingestionextractionbaidu

nvidiaaudio2face-3d

Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.

speech-to-animationdigital humansaudio-to-facenvidia nimnvidia

nvidiaBuild a Digital Twin for Interactive Fluid Simulation

This NVIDIA Omniverseâ„¢ Blueprint demonstrates how commercial software vendors can create interactive digital twins.

nvidia omniverseblueprintcaesimulationexternal aerodynamicscomputer-aided-engineeringnvidia

nvidiaconformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance

asrstreamingspeech-to-textspanishnvidia nimnvidia rivanvidia

nvidiacorrdiff

Generative downscaling model for generating high resolution regional scale weather fields.

ai weather predictionweather simulationearth-2nvidia

nvidiafourcastnet

FourCastNet predicts global atmospheric dynamics of various weather / climate variables.

weather simulationai weather predictionclimate scienceearth-2nvidia

hivedeepfake-image-detection

Advanced AI model detects faces and identifies deep fake images.

computer visionai safetydeep fake detectioncontent moderationhive

nvidiaBuild a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

visionvideo-to-textgenerative aiblueprintchatnvidia ainvidia

nvidia3D Conditioning for Precise Visual Generative AI

Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.

visual designnvidia omniverseblueprintsimulationnvidia

nvidiaBuild an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

customer serviceblueprintretrieval-augmented generationllmcontact centernvidia ainvidia

nvidianemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

indicchattext-to-textlanguage generationnvidia

ibmgranite-guardian-3.0-8b

Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

guardrailtext-to-textibm

ibmgranite-3.0-8b-instruct

Advanced Small Language Model supporting RAG, summarization, classification, code, and agentic AI

small language modelchattext-to-textibm

ibmgranite-3.0-3b-a800m-instruct

Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification

small language modelmoelanguage generationtext-to-textibm

nvidiallama-3.1-nemotron-70b-instruct

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

code generationchattext-to-textlanguage generationnvidia

zyphrazamba2-7b-instruct

Efficient hybrid state-space model designed for conversational and reasoning tasks.

chatlanguage generationtext-to-textzyphra

nvidiaVulnerability Analysis for Container Security

Rapidly identify and mitigate container security vulnerabilities with generative AI.

generative ainv-embedqa-e5-v5blueprintllama-3_1-70b-instructcybersecuritynvidia ainvidia

institute-of-science-tokyollama-3.1-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ailarge language modelchatregional language generationinstitute-of-science-tokyo

institute-of-science-tokyollama-3.1-swallow-8b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ailarge language modelchatregional language generationinstitute-of-science-tokyo

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

run on rtxnvidia maxinespeech-to-speechdigital humanspeech enhancementnvidia

nvidiamistral-nemo-minitron-8b-8k-instruct

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

small language modelcode generationchattext-to-textlanguage generationnvidia

nvidiallama-3.1-nemotron-70b-reward

Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

text-to-textreward modelrlhfnvidia

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.2-11b-vision-instruct

Cutting-edge vision-language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage-to-textimage captioningvisual groundingmeta

metallama-3.2-90b-vision-instruct

Cutting-edge vision-Language model exceling in high-quality reasoning from images.

image-text retrievalvisual qaimage captioningimage-to-textvisual groundingmeta

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationmeta

nvidiallama-3.1-nemotron-51b-instruct

Unique language model that delivers an unmatched accuracy-efficiency performance.

language generationchattext-to-textnvidia

qwenqwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

chinese language generationchattext-to-textlarge language modelsqwen

abacusaidracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

code generationtext-to-textabacusai

deepmindalphafold2-multimer

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

nvidiaconsistory

Generates consistent characters across a series of images without requiring additional training.

image generationtext-to-imagenvidia

nvidiavila

Multi-modal vision-language model that understands text/img/video and creates informative responses

vlmvision language modelimage captionimage to textnvidia

hiveai-generated-image-detection

Robust image classification model for detecting and managing AI-generated content.

image classificationcomputer visionai safetycontent moderationhive

metaesm2-650m

Generates embeddings of proteins from their amino acid sequences.

nimprotein embeddingbionemobiologydrug discoverymeta

deepmindalphafold2

Predicts the 3D structure of a protein from its amino acid sequence.

nimbionemobiologyprotein foldingdrug discoverydeepmind

nvidiaBuild a Digital Human

Create intelligent, interactive avatars for customer service across industries

digital humansspeech-to-textnvidia omniverseblueprintchataudio-to-facenvidia ainvidia

yentinglinllama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generationchatcode generationlarge language modelsyentinglin

tokyotech-llmllama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language modelchatregional language generationtokyotech-llm

nvidiaBuild A Generative Virtual Screening Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.

chemistrynimnvidia bionemoblueprintbionemodockingdrug discoverynvidia

microsoftphi-3.5-vision-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

vision assistantvisual question answeringlanguage generationimage-to-textmicrosoft

ai21labsjamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-textai21labs

ai21labsjamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chatlanguage generationtext-to-textai21labs

nvidianemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

chattext-to-textlanguage generationnvidia

nvidiamistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generationtext-to-textchatsmall language modelnvidia

microsoftphi-3.5-moe-instruct

Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation

moecode generationchattext-to-textlanguage generationmicrosoft

microsoftphi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

nvidianv-dinov2

NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.

image-to-embeddingcomputer visiondeepstreamnvidia nimobject classificationnvidia

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chattext-to-textlanguage generationlarge language modelsrakuten

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chattext-to-textlanguage generationlarge language modelsrakuten

nvidianv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

object detectioncomputer visiondeepstreamnvidia nimnvidia

briaaiBRIA-2.3

An enterprise-grade text-to-image model trained on a compliant dataset produces high quality images.

image generationtext-to-imagebriaai

nvidiaradtts-hifigan-tts

Natural, high-fidelity, English voices for personalizing text-to-speech services and voiceovers

text-to-speechtext-to-speechnvidia nimnvidia

nvidiamegatron-1b-nmt

Enable smooth global interactions in 36 languages.

text translationneural machine translationnvidia nimnvidia

nvidiafastpitch-hifigan-tts

Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots

text-to-speechnvidia nimnvidia

nvidiaparakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

asrstreamingenglishspeech-to-textbatchnvidia nimnvidia

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

asrstreamingenglishbatchrun on rtxspeech-to-textfastnvidia nimnvidia

ipdproteinmpnn

ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.

biologynimbionemodrug discoveryprotein generationipd

microsoftflorence-2

Vision foundation model capable of performing diverse computer vision and vision language tasks.

image classificationimageobject detectioncvmultimodalvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-texttext-to-imagemicrosoft

writerpalmyra-fin-70b-32k

Specialized LLM for financial analysis, reporting, and data processing

financetext-to-textwriter

googleshieldgemma-9b

Guardrail model to ensure that responses from LLMs are appropriate and safe

guardrailtext-to-textgoogle

googlegemma-2-2b-it

Advanced small language generative AI model for edge applications

code generationchattext-to-textlanguage generationgoogle

nvidiausdsearch

AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.

openusdsynthetic data generationdigital twinusdtext-to-3dnvidia nimnvidia

GettyImagesedify-image

Getty Images’ API service for 4K image generation. Trained on NVIDIA Edify using Getty Images' commercially safe creative libraries.

outpaintimage generationreplaceimage modificationinpaintgettyimages

nvidiaeyecontact

Estimate gaze angles of a person in a video and redirect to make it frontal.

telepresencenvidia maxinedigital humannvidia

nvidiaaudio2face-2d

Create facial animations using a portrait photo and synchronize mouth movement with audio.

speech-to-animationtelepresencenvidia maxinedigital humannvidia

nvidiausdvalidate

Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.

validationopenusdsynthetic data generationdigital twinusdvisualization 3dnvidia

thudmchatglm3-6b

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

chattext-to-textregional language generationthudm

mistralaimamba-codestral-7b-v0.1

Model for writing and interacting with code across a wide range of programming languages and tasks.

code completioncode generationcode generationmistralai

baichuan-incbaichuan2-13b-chat

Support Chinese and English chat, coding, math, instruction following, solving quizzes

chinese language generationtext translationchattext-to-textbaichuan-inc

metallama-3.1-405b-instruct

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

synthetic data generationchatcode generationmeta

metallama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

code generationchattext-to-textlanguage generationmeta

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

run on rtxcode generationchattext-to-textlanguage generationmeta

nv-mistralaimistral-nemo-12b-instruct

Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

run on rtxcode generationchatlanguage generationtext-to-textnv-mistralai

nvidianv-rerankqa-mistral-4b-v3

Multilingual text reranking model.

nemo retrieverrerankingretrieval augmented generationnvidia

nvidianv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

embeddingretrieval augmented generationnemo retrievertext-to-embeddingnvidia

nvidianv-embedqa-mistral-7b-v2

Multilingual text question-answering retrieval, transforming textual information into dense vector representations.

nemo retrieverembeddingretrieval augmented generationnvidia

nvidiamaisi

MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.

image generationmedical imagingnvidia nimnvidia

microsoftphi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

bigcodestarcoder2-7b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

bigcodestarcoder2-15b

Advanced programming model for code completion, summarization, and generation

code completioncode generationcode generationbigcode

googlegemma-2-27b-it

Cutting-edge text generation model text understanding, transformation, and code generation.

code generationchattext-to-textlanguage generationgoogle

nvidiallama3-chatqa-1.5-70b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

nvidiallama3-chatqa-1.5-8b

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

text-to-textnon-commercial use onlychatnvidia

01-aiyi-large

Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.

code generationchattext-to-textmultilingual01-ai

mistralaimistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

chattext-to-textlanguage generationmistralai

nvidianvclip

NV-CLIP is a multimodal embeddings model for image and text.

computer visionmultimodal embeddingstext and imagerun on rtxnvidia nimnvidia

stabilityaistable-diffusion-3-medium

Advanced text-to-image model for generating high quality images

image generationtext-to-imagestabilityai

nvidiaocdrnet

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

optical character recognitionimageoptical character detectioncvvlmcomputer visiontao toolkitvideonvidia

writerpalmyra-med-70b-32k

Leading LLM for accurate, contextually relevant responses in the medical domain.

text-to-texthealthcarewriter

writerpalmyra-med-70b

Leading LLM for accurate, contextually relevant responses in the medical domain.

text-to-texthealthcarewriter

nvidianv-embed-v1

Generates high-quality numerical embeddings from text inputs.

non-commercial use onlyretrieval augmented generationtext-to-embeddingnvidia

upstagesolar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

non-commercial use onlychattext-to-textlanguage generationlarge language modelsupstage

baaibge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

embeddingsretrieval augmented generationtext-to-embeddingbaai

mediatekbreeze-7b-instruct

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

chattext-to-textregional language generationmediatek

nvidiavisual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

imageimage generationcvimage segmentationvlmcomputer visiontao toolkitvideonvidia nimnvidia

googlecodegemma-1.1-7b

Advanced programming model for code generation, completion, reasoning, and instruction following.

code generationcode completiongoogle

ibmgranite-34b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

code generationchatlarge language modelstext-to-codeibm

ibmgranite-8b-code-instruct

Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

code generationchatlarge language modelstext-to-codeibm

nvidiaretail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

object detectionimagecvvlmcomputer visiontao toolkitvideonvidia nimnvidia

ipdrfdiffusion

A generative model of protein backbones for protein binder design.

biologynimbionemodrug discoveryprotein generationipd

microsoftphi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

microsoftphi-3-vision-128k-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideomicrosoft

googlepaligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

imagecvvision assistantvlmvisual question answeringcomputer visionlanguage generationimage-to-textvideogoogle

aisingaporesea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

chattext-to-textregional language generationlarge language modelsaisingapore

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

databricksdbrx-instruct

A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.

chattext-to-textlanguage generationlarge language modelsdatabricks

snowflakearctic-embed-l

Optimized community model for text embedding.

nemo retrieverembeddingretrieval augmented generationtext-to-embeddingsnowflake

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

code generationchattext-to-textlanguage generationlarge language modelsmicrosoft

mistralaimixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai

metallama3-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

large language modelscode generationchattext-to-textlanguage generationmeta

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

code generationchattext-to-textlanguage generationlarge language modelsmeta

googlerecurrentgemma-2b

Novel recurrent architecture based language model for faster inference when generating long sequences.

code generationchattext-to-textlanguage generationgoogle

googlecodegemma-7b

Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.

code generationchatlanguage generationtext-to-codegoogle

googlegemma-2b

Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.

code generationchattext-to-textlanguage generationgoogle

nvidiaembed-qa-4

GPU-accelerated generation of text embeddings used for question-answering retrieval.

embeddingsretrieval augmented generationtext-to-embeddingnvidia

nvidiarerank-qa-mistral-4b

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

rankingretrieval augmented generationnvidia

stabilityaistable-diffusion-xl

Generate images and stunning visuals with realistic aesthetics.

image generationtext-to-imagestabilityai

microsoftkosmos-2

Groundbreaking multimodal model designed to understand and reason about visual elements in images.

imagecvmultimodalvlmvisual question answeringcomputer visionimage understandingimage-to-textvideomicrosoft

googledeplot

Translate images of plots into tables with one-shot visual language understanding.

nemo retrievermultimodaldata ingestionimage-to-textgoogle

nvidianeva-22b

Multi-modal vision-language model that understands text/images and generates informative responses

imagecvvision assistantnon-commercial use onlyvlmvisual question answeringcomputer visionimage-to-textvideonvidia

adeptfuyu-8b

Multi-modal model for a wide range of tasks, including image understanding and language generation.

imagecvmultimodalvlmcomputer visionimage understandinglanguage generationimage-to-textvideoadept

nvidiavista-3d

VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.

interactive annotationimage segmentationnon-commercial use onlymedical imagingnvidia

googlegemma-7b

Cutting-edge text generation model text understanding, transformation, and code generation.

code generationchattext-to-textlanguage generationgoogle

mistralaimistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

text-to-textlanguage generationnvidia nimmistralai

nvidiafq2bam

Generate BAM output given one or more pairs of FASTQ files, by running BWA-MEM & GATK best practices.

parabricksgenomicsdna sequencingnvidia

nvidiadeepvariant

Run Google's DeepVariant optimized for GPU. Switch models for high accuracy on all major sequencers.

parabricksgenomicsdna sequencingnvidia

stabilityaistable-video-diffusion

Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.

image generationtext-to-imagestabilityai

stabilityaisdxl-turbo

A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

image generationtext-to-imagestabilityai

nvidiamolmim

MolMIM performs controlled generation, finding molecules with the right properties.

chemistrynimbionemomolecule generationdrug discoverynvidia

metaesmfold

Predicts the 3D structure of a protein from its amino acid sequence.

biologynimbionemoprotein foldingdrug discoverymeta

mitdiffdock

Predicts the 3D structure of how a molecule interacts with a protein.

chemistrynimbionemodockingdrug discoverymit

mistralaimixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoningcode generationchattext-to-textlarge language modelsmistralai

nvidiacuopt

World-record accuracy and performance for complex route optimization.

route optimizationnvidia