Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
26 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Free Endpoint
nemotron-voicechat
Nemotron 3 Voicechat
English
+2
2.64K
1w
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Automatic Speech Recognition
+2
1.26K
1w
NVIDIA
Free Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
161K
3w
NVIDIA
Free Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
nvidia nim
+2
602K
3mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
ASR
+4
366
5mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
ASR
+4
5.94K
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
ASR
+4
38
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
ASR
+4
799
6mo
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
ASR
+4
2.04K
7mo
NVIDIA
Free Endpoint
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
767
8mo
NVIDIA
Downloadable
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
26.63K
9mo
NVIDIA
Free Endpoint
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
1.29K
9mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Automatic Speech Recognition
+3
15.63K
10mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
TTS
+4
37.09K
9mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
ASR
+8
73.86K
11mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Automatic Speech Recognition
+3
6.31K
11mo
NVIDIA
Downloadable
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Digital Humans
+3
9mo
NVIDIA
Free Endpoint
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
computer vision
+4
1.21M
1y
NVIDIA
Free Endpoint
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Object Detection
+4
3.85K
12mo
NVIDIA
Downloadable
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
11mo
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
ASR
+5
57.31K
9mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
ASR
+7
8.88K
9mo
NVIDIA
Downloadable
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Image Generation
+2
893
1y
NVIDIA
Free Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
image
+8
777
1y
Items per page
24
1
1
2
2
of 2 pages