Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
40 results for
Filters
Models (27)
Blueprints (13)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
833
8mo
NVIDIA
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Model
TTS
+4
33.92K
8mo
NVIDIA
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
1.23K
8mo
NVIDIA
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
35.82K
10mo
NVIDIA
Multi-LLM NIM
Use the multi-LLM compatible NIM container to deploy a broad range of LLMs from Hugging Face.
Blueprint
nim
2w
NVIDIA
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
Model
PII Detection
+1
76.04K
5d
NVIDIA
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Model
Image Generation
+2
742
11mo
NVIDIA
Launchable
Retail Catalog Enrichment
A GenAI system that enhances and localizes product catalogs with rich text content and imagery.
Blueprint
nim
+2
2w
NVIDIA
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
3.39K
11mo
NVIDIA
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
2.98K
7mo
NVIDIA
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Model
Speech-to-Animation
+3
8mo
NVIDIA
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Model
Text Translation
+2
11mo
NVIDIA
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
Model
Image-to-Embedding
+4
1.07M
11mo
NVIDIA
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Model
Object Detection
+3
3.51K
11mo
NVIDIA
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
6mo
NVIDIA
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
712
6mo
NVIDIA
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
7.71K
6mo
NVIDIA
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
383
4mo
NVIDIA
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
34.17K
8mo
NVIDIA
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Model
Text Translation
+2
632K
8mo
NVIDIA
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
Model
nvidia nim
+2
499K
2mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
nim
+4
2w
NVIDIA
Launchable
AI Agent for Telecom Network Configuration Planning
Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
Blueprint
nim
+4
2w
NVIDIA
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
+7
8.55K
8mo
Items per page
24
1
1
2
2
of 2 pages