Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
14 models
Sort By
dateCreated:DESC
Most Recent
Black-forest-labs
Downloadable
flux.2-klein-4b
FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed
Text-to-Image
+2
1.99K
3d
Microsoft
Downloadable
TRELLIS
MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
text-to-3d
+2
4.81K
6mo
Black-forest-labs
Downloadable
FLUX.1-Kontext-dev
FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
Text-to-Image
+2
3.72K
7mo
Black-forest-labs
Downloadable
FLUX.1-schnell
FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
Text-to-Image
+2
48.18K
9mo
Black-forest-labs
Downloadable
FLUX.1-dev
FLUX.1 is a state-of-the-art suite of image generation models
Text-to-Image
+2
99.9K
9mo
DeepSeek AI
Downloadable
deepseek-r1-distill-llama-8b
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distillation
+5
4.99M
8mo
NVIDIA
Downloadable
nemoretriever-page-elements-v2
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+6
163K
12mo
NVIDIA
Downloadable
nv-yolox-page-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Object Detection
+6
15.5K
8mo
Baidu
Downloadable
paddleocr
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Optical Character Recognition
+6
132K
8mo
NVIDIA
Downloadable
studiovoice
Enhance speech by correcting common audio degradations to create studio quality speech output.
Nvidia Maxine
+4
574
9mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
ASR
+7
8.37K
9mo
Meta
Downloadable
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
chat
+5
5.44M
8mo
NVIDIA
Downloadable
nv-embedqa-e5-v5
English text embedding model for question-answering retrieval.
Embedding
+4
2.97M
7mo
NVIDIA
Downloadable
nvclip
NV-CLIP is a multimodal embeddings model for image and text.
Computer vision
+3
17.88K
9mo
Items per page
24
1
1
of 1 pages