Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (1)
26 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
Free Endpoint
nemotron-voicechat
Nemotron 3 Voicechat
English
+2
6.36K
3w
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Automatic Speech Recognition
+2
18.79K
3w
NVIDIA
Free Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
51.21K
1mo
NVIDIA
Free Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
nvidia nim
+2
132K
3mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
ASR
+4
118
5mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
ASR
+4
3.87K
7mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
ASR
+4
79
7mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
ASR
+4
211
7mo
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
ASR
+4
1.47K
8mo
NVIDIA
Deprecation in 4d
Free Endpoint
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
1.11K
9mo
NVIDIA
Downloadable
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
28.07K
9mo
NVIDIA
Free Endpoint
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
1.38K
10mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Automatic Speech Recognition
+3
2.8K
11mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
TTS
+4
35.97K
9mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
ASR
+8
69.83K
1y
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Automatic Speech Recognition
+3
3.85K
1y
NVIDIA
Downloadable
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Digital Humans
+3
10mo
NVIDIA
Deprecation in 4d
Free Endpoint
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
computer vision
+4
935K
1y
NVIDIA
Deprecation in 4d
Free Endpoint
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Object Detection
+4
6.81K
1y
NVIDIA
Downloadable
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
2
1y
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
ASR
+5
57.63K
9mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
ASR
+7
2.47K
10mo
NVIDIA
Downloadable
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Image Generation
+2
1.01K
1y
NVIDIA
Deprecation in 4d
Free Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
image
+8
622
1y
Items per page
24
1
1
2
2
of 2 pages