Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
17 results for
Filters
Models (13)
Blueprints (2)
Other (2)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
conformer-ctc-asr
Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
Model
ASR
+5
34
1y
Items per page
24
1
1
of 1 pages
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
71.47K
11mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
28.35K
1y
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Model
Automatic Speech Recognition
+2
16.6K
2mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
26.6K
1y
General
Developer Example
Nemotron Voice Agent
Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
Blueprint
Voice Agent
+4
2mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
1.39K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
130
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
7.28K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
1.57K
7mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
Model
ASR
+8
77.68K
1y
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
49.29K
10mo
Healthcare & Life Sciences
Launchable
Developer Example
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
3mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
138K
11mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
7.15M
3mo
DGX Spark
30 MIN
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
Playbook
DGX Spark
+3
1mo
DGX Spark
30 MIN
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
Playbook
DGX
+2
7mo