Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
20 results for
Filters
Models (16)
Blueprints (2)
Other (2)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
49.84K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
+7
8.58K
9mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
5.48K
11mo
NVIDIA
Downloadable
nemotron-asr-streaming
Real-time speech recognition for English
Model
Automatic Speech Recognition
+2
331
4d
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
31.2K
10mo
NVIDIA
Nemotron Voice Agent
A voice agent that uses the Nemotron model to generate responses to voice commands.
Blueprint
Voice Agent
+4
2w
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
10
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
742
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
8.09K
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
364
5mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
Model
ASR
+8
52.35K
11mo
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
2.57K
7mo
NVIDIA
Launchable
Ambient Healthcare Agents
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM
Blueprint
NVIDIA AI
+3
4w
Speakleash
Free Endpoint
bielik-11b-v2.6-instruct
State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
Model
chat
+4
583K
6mo
NVIDIA
Downloadable
llama-3.2-nemoretriever-1b-vlm-embed-v1
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
259K
8mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
498K
8mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
1.21M
1mo
Baidu
Downloadable
paddleocr
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Model
Optical Character Recognition
+6
85.68K
8mo
DGX Spark
30 MIN
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue
Playbook
DGX
+2
5mo
DGX Spark
Vibe Coding in VS Code
Use DGX Spark as a local or remote Vibe Coding assistant with Ollama and Continue.dev
Playbook
DGX
+2
5mo
Items per page
24
1
1
of 1 pages