Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
24 results for
Filters
Models (24)
Blueprints (0)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Model
Digital Humans
+3
9mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Model
Automatic Speech Recognition
+3
5.1K
11mo
NVIDIA
API Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
Model
PII Detection
+1
145K
1w
NVIDIA
API Endpoint
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
776
8mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Model
TTS
+4
32.45K
8mo
NVIDIA
API Endpoint
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
Model
TTS
+3
1.26K
9mo
NVIDIA
Downloadable
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Model
Image Generation
+2
735
11mo
NVIDIA
Downloadable
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Model
Neural machine translation
+2
11mo
Mistral AI
API Endpoint
mistral-7b-instruct-v0.2
This LLM follows instructions, completes requests, and generates creative text.
Model
chat
+3
567K
9mo
NVIDIA
API Endpoint
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
Model
computer vision
+4
1.18M
11mo
NVIDIA
API Endpoint
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Model
Object Detection
+4
3.6K
11mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Model
Automatic Speech Recognition
+3
36.22K
10mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
Model
ASR
+7
8.48K
9mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
Model
ASR
+4
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
Model
ASR
+4
743
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
Model
ASR
+4
7.64K
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
Model
ASR
+4
419
4mo
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
Model
ASR
+5
45.51K
8mo
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
Model
ASR
+4
2.66K
7mo
NVIDIA
API Endpoint
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Model
Object Detection
+8
363
1y
NVIDIA
Downloadable
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Model
Neural machine translation
+2
446K
8mo
NVIDIA
API Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
Model
nvidia nim
+2
567K
3mo
NVIDIA
API Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
Model
image
+8
640
1y
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
Model
ASR
+8
54.31K
11mo
Items per page
24
1
1
of 1 pages