Explore

Models

Skills

Blueprints

GPUs

Docs

Your Privacy Choices

Contact

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image Generation

Text-to-Image

Retrieval Augmented Generation

Speech-to-Text

Code Generation

Inference Providers

Deepinfra

OpenRouter

Together AI

GMI Cloud

CoreWeave

Publisher

NVIDIA

Black forest labs

flux.2-klein-4b

FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed

image editing

Items per page

of 1 pages

271K

4mo

Microsoft

Downloadable

TRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

text-to-3d

10mo

Black-forest-labs

Downloadable

FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Text-to-Image

11mo

Black-forest-labs

Downloadable

FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Text-to-Image

253K

Black-forest-labs

Downloadable

FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Text-to-Image

246K

NVIDIA

Downloadable

nemoretriever-page-elements-v2

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

243K

NVIDIA

Downloadable

nv-yolox-page-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

191

Baidu

Downloadable

paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character Recognition

201K

NVIDIA

Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

ASR

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

Chat

19M

NVIDIA

Downloadable

nv-embedqa-e5-v5

English text embedding model for question-answering retrieval.

Embedding

16M

11mo