⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

0

Partner Endpoint

6

Download Available

13

Use Case

Image Generation

4

Text-to-Image

4

Code Generation

1

Retrieval Augmented Generation

1

Object Detection

1

Inference Providers

Deep Infra

4

Together AI

2

GMI Cloud

2

CoreWeave

1

Bitdeer AI

0

Publisher

NVIDIA

5

Black forest labs

4

Meta

1

Microsoft

1

DeepSeek AI

1

API Catalog Type

Enterprise

0

Blueprint Type

NVIDIA BioNemo

0

Labels (1)

Run-on-RTX

13 models

Sort By

Black-forest-labs

Downloadable

flux.2-klein-4b

FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed

97.95K

1mo

Downloadable

TRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

3.81K

7mo

Black-forest-labs

Downloadable

FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

3.15K

8mo

Black-forest-labs

Downloadable

FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

82.27K

10mo

Black-forest-labs

Downloadable

FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

57.23K

10mo

Downloadable

deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

1.46M

9mo

Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

70.4K

1y

Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

2.2K

9mo

Downloadable

paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character Recognition

277K

9mo

Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

2.66K

10mo

Downloadable

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

12.66M

9mo

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

12.37M

8mo

Downloadable

nvclip

NV-CLIP is a multimodal embeddings model for image and text.

Computer vision

83.93K

10mo

Items per page

of 1 pages