Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
19 results for
Filters
Models (15)
Blueprints (2)
Other (2)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
conformer-ctc-asr
Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance
Model
ASR
+5
1y
Items per page
24
1
1
of 1 pages
42
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
79.22K
10mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
+7
3.14K
11mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
4.34K
1y
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Model
Automatic Speech Recognition
+2
21.04K
2mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
25.59K
1y
NVIDIA
Nemotron Voice Agent
Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
Blueprint
Voice Agent
+4
2mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
1.38K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
127
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
4.17K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
676
7mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
Model
ASR
+8
61.99K
1y
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
23.63K
9mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
2mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
219K
10mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
6.63M
3mo
Baidu
Downloadable
paddleocr
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Model
Optical Character Recognition
+6
2.83M
10mo
DGX Spark
30 MIN
Run models with llama.cpp on DGX Spark
Build llama.cpp with CUDA and serve models via an OpenAI-compatible API (Nemotron 3 Nano Omni as example)
Playbook
DGX Spark
+3
1mo
DGX Spark
30 MIN
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
Playbook
DGX
+2
7mo