NVIDIA
Explore
Models
Blueprints
GPUs
Docs

Deploy Models Now with NVIDIA NIM

Optimized inference for the world’s leading models
Free serverless APIs for development
Accelerated by DGX Cloud
Self-Host on your GPU infrastructure
Continuous vulnerability fixes
DiscoverModelsBlueprintsGPUsDocsForums

workstations

  • Run on RTX
  • Run on Spark

models

  • Reasoning
  • Vision
  • Visual Design
  • Retrieval
  • Speech
  • Biology
  • Simulation
  • Climate & Weather
  • Safety & Moderation

industries

  • Automotive
  • Financial Services
  • Gaming
  • Healthcare
  • Industrial
  • Robotics

Run on RTX

Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Developer Favorites

Run AI models on PCs and Workstations powered by RTX GPUs.

Run Anywhere

black-forest-labsFLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Run-on-RTXImage GenerationText-to-Image
Run Anywhere

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

ChatLanguage GenerationRun-on-RTXText-to-TextCode Generation

nvidia3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

BlueprintRun-on-RTXNVIDIA AI
Run Anywhere

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Run-on-RTXImage GenerationText-to-Image

Visual Design

The latest innovations in multimedia generation models.

Run Anywhere

microsoftTRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

Run-on-RTXimage-to-3dtext-to-3d
Run Anywhere

black-forest-labsFLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Run-on-RTXImage GenerationText-to-Image
Run Anywhere

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Run-on-RTXImage GenerationText-to-Image

nvidia3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

BlueprintRun-on-RTXNVIDIA AI

Vision

Multimodal models that can reason against image and video inputs and perform descriptive language generation.

Run Anywhere

nvidianvclip

NV-CLIP is a multimodal embeddings model for image and text.

Computer visionRun-on-rtxmultimodal embeddingstext and image

Speech

Run AI models on PCs and Workstations powered by RTX GPUs.

Run Anywhere

nvidiastudiovoice

Enhance speech by correcting common audio degradations to create studio quality speech output.

Digital HumanNvidia MaxineRun-on-RTXSpeech EnhancementSpeech-to-speech
Run Anywhere

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

ASRBatchEnglishFastNVIDIA NIMRun-on-RTXStreamingSpeech-to-Text

Explore NVIDIA RTX Blueprints

Get started with workflows and code samples to build AI applications on RTX from the ground up.

nvidia3D Object Generation

Transform your scene idea into ready-to-use 3D assets using Llama 3.1 8B, NV SANA, and Microsoft TRELLIS

BlueprintRun-on-RTXNVIDIA AI

nvidia3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

BlueprintRun-on-RTXNVIDIA AI

Retrieval

Accelerate large-scale extraction from massive collections of multimodal data—text, images, and complex documents—for rapid, context-aware insights across your enterprise.

Run Anywhere

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character DetectionOptical Character RecognitionTable Extractiondata ingestionextractionnemo retrieverrun-on-rtx
Run Anywhere

nvidianv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Chart DetectionData ingestionObject DetectionTable Detectionextractionnemo retrieverrun-on-rtx
Run Anywhere

nvidiallama-3.2-nv-embedqa-1b-v2

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

embeddingnemo retrieverrun-on-rtxRetrieval Augmented GenerationText-to-Embedding

Reasoning

The latest innovations in intelligence models.

Run Anywhere

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

ChatLanguage GenerationRun-on-RTXText-to-TextCode Generation
Run Anywhere

deepseek-aideepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

Distillationcodingmathreasoningrun-on-rtx