Try NVIDIA NIM APIs

DownloadableFree Endpoint

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

thinking budget

Items per page

of 4 pages

988K

9mo

Downloadable

cuopt

World-record accuracy and performance for complex route optimization.

Route Optimization

83.3K

Downloadable

conformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance

Downloadable

magpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

TTS

117K

11mo

Free Endpoint

magpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

TTS

19.38K

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Automatic Speech Recognition

9.31K

3mo

Downloadable

parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

Automatic Speech Recognition

18.81K

Downloadable

eyecontact

Estimate gaze angles of a person in a video and redirect to make it frontal.

telepresence

1.62K

Free Endpoint

gliner-pii

GLiNER PII detects Personally Identifiable Information in text.

PII Detection

243K

3mo

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

English

1.77K

2mo

Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech Recognition

52.12K

Downloadable

parakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

146K

10mo

Downloadable

megatron-1b-nmt

Enable smooth global interactions in 36 languages.

Neural machine translation

Downloadable

parakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

1.42K

9mo

Downloadable

parakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

123

9mo

Downloadable

parakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.

12.66K

9mo

Downloadable

parakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

1.18K

8mo

Downloadable

parakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

59.38K

11mo

Downloadable

Relighting

Re-illuminate people in video to match target lighting from a 360 HDRI environment map.

HDRI

227

1mo

Downloadable

riva-translate-1.6b

Enable smooth global interactions in 36 languages.

Neural machine translation

42.06K

11mo

Free Endpoint

riva-translate-4b-instruct-v1_1

Translation model in 12 languages with few-shots example prompts capability.

nvidia nim

282K

6mo

DownloadableFree Endpoint

Active Speaker Detection

Detect and track speaker identities across video frames.

broadcast

473

1mo

Downloadable

LipSync

Generative lip dubbing that syncs lips in a video to input audio.

broadcast

1mo

DownloadableFree Endpoint

Studio Voice

Enhance input speech recorded with low-quality microphones in noisy or reverberant environments, producing studio-quality speech.