Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
24 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
API Endpoint
gliner-pii
GLiNER PII detects Personally Identifiable Information in text.
PII Detection
+1
145K
1w
NVIDIA
API Endpoint
riva-translate-4b-instruct-v1_1
Translation model in 12 languages with few-shots example prompts capability.
nvidia nim
+2
567K
3mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-tw
Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
ASR
+4
419
4mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-zh-cn
Record-setting accuracy and performance for Mandarin English transcriptions.
ASR
+4
7.64K
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-es
Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.
ASR
+4
6mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-vi
Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.
ASR
+4
743
6mo
NVIDIA
Downloadable
parakeet-tdt-0.6b-v2
Accurate and optimized English transcriptions with punctuation and word timestamps
ASR
+4
2.66K
7mo
NVIDIA
API Endpoint
magpie-tts-flow
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
776
8mo
NVIDIA
Downloadable
riva-translate-1.6b
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
446K
8mo
NVIDIA
API Endpoint
magpie-tts-zeroshot
Expressive and engaging text-to-speech, generated from a short audio sample.
TTS
+3
1.26K
9mo
NVIDIA
Downloadable
parakeet-1.1b-rnnt-multilingual-asr
High accuracy and optimized performance for transcription in 25 languages
Automatic Speech Recognition
+3
36.22K
10mo
NVIDIA
Downloadable
magpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
TTS
+4
32.45K
8mo
OpenAI
Downloadable
whisper-large-v3
Robust Speech Recognition via Large-Scale Weak Supervision.
ASR
+8
54.31K
11mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Automatic Speech Recognition
+3
5.1K
11mo
NVIDIA
Downloadable
audio2face-3d
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Digital Humans
+3
9mo
NVIDIA
API Endpoint
nv-dinov2
NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
computer vision
+4
1.18M
11mo
NVIDIA
API Endpoint
nv-grounding-dino
Grounding dino is an open vocabulary zero-shot object detection model.
Object Detection
+4
3.6K
11mo
NVIDIA
Downloadable
megatron-1b-nmt
Enable smooth global interactions in 36 languages.
Neural machine translation
+2
11mo
NVIDIA
Downloadable
parakeet-ctc-1.1b-asr
Record-setting accuracy and performance for English transcription.
ASR
+5
45.51K
8mo
NVIDIA
Downloadable
parakeet-ctc-0.6b-asr
State-of-the-art accuracy and speed for English transcriptions.
ASR
+7
8.48K
9mo
NVIDIA
Downloadable
maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
Image Generation
+2
735
11mo
NVIDIA
API Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
image
+8
640
1y
NVIDIA
API Endpoint
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Object Detection
+8
363
1y
Mistral AI
API Endpoint
mistral-7b-instruct-v0.2
This LLM follows instructions, completes requests, and generates creative text.
chat
+3
567K
9mo
Items per page
24
1
1
of 1 pages