Try NVIDIA NIM APIs

Downloadable

conformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance

Items per page

of 1 pages

Downloadable

magpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

TTS

36.79K

10mo

Free Endpoint

magpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

TTS

1.94K

10mo

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Automatic Speech Recognition

23.59K

1mo

Downloadable

parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

Automatic Speech Recognition

3.14K

12mo

Free Endpoint

gliner-pii

GLiNER PII detects Personally Identifiable Information in text.

PII Detection

118K

1mo

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

English

5.14K

1mo

Downloadable

megatron-1b-nmt

Enable smooth global interactions in 36 languages.

Neural machine translation

Downloadable

parakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

7mo

Downloadable

parakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

171

7mo

Downloadable

parakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.

3.07K

7mo

Downloadable

parakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

158

6mo

Downloadable

parakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

66.24K

10mo

Downloadable

riva-translate-1.6b

Enable smooth global interactions in 36 languages.

Neural machine translation

28.92K

10mo

Free Endpoint

riva-translate-4b-instruct-v1_1

Translation model in 12 languages with few-shots example prompts capability.

nvidia nim

209K

4mo

Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech Recognition

3.7K

Downloadable

parakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

1.71K

8mo

Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

2.38K

10mo

OpenAI

Downloadable

whisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

71.09K

Downloadable

genmol

Fragment-Based Molecular Generation by Discrete Diffusion.

Chemistry

6.75K

9mo

Downloadable

molmim

MolMIM performs controlled generation, finding molecules with the right properties.

Chemistry

197K

9mo

Downloadable

nemoguard-jailbreak-detect

Industry leading jailbreak classification model for protection from adversarial attempts