Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
27 results for
Filters
Models (27)
Blueprints (13)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
833
8mo
NVIDIA
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Model
TTS
+4
33.92K
8mo
NVIDIA
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
1.23K
9mo
NVIDIA
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
35.82K
10mo
NVIDIA
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
Model
PII Detection
+1
76.04K
6d
NVIDIA
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Model
Image Generation
+2
742
11mo
NVIDIA
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Model
Speech-to-Animation
+3
8mo
NVIDIA
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Model
Text Translation
+2
11mo
NVIDIA
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
Model
Image-to-Embedding
+4
1.07M
11mo
NVIDIA
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Model
Object Detection
+3
3.51K
11mo
NVIDIA
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
6mo
NVIDIA
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
712
6mo
NVIDIA
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
7.71K
6mo
NVIDIA
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
383
4mo
NVIDIA
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
34.17K
8mo
NVIDIA
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Model
Text Translation
+2
632K
8mo
NVIDIA
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
Model
nvidia nim
+2
499K
2mo
NVIDIA
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
3.39K
11mo
NVIDIA
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
2.98K
7mo
NVIDIA
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
+7
8.55K
8mo
NVIDIA
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Model
Object Detection
+7
803
1y
NVIDIA
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
Model
image
+8
624
1y
OpenAI
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
Model
ASR
+8
49.51K
11mo
NVIDIA
genmol
Fragment-Based Molecular Generation by Discrete Diffusion.
Model
Chemistry
+4
2.52K
7mo
Items per page
24
1
1
2
2
of 2 pages