Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

24 results for

Filters (1)

Download Available

API Endpoint

Launchable

Enterprise

Use Case

Speech-to-Text

Text Translation

Text-to-Speech

Object Detection

Image Generation

Publisher

NVIDIA

Mistral AI

OpenAI

audio2face-3d

Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.

Model

Digital Humans

9mo

NVIDIA

Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Model

Automatic Speech Recognition

5.1K

11mo

NVIDIA

API Endpoint

gliner-pii

GLiNER PII detects Personally Identifiable Information in text.

Model

PII Detection

145K

NVIDIA

API Endpoint

magpie-tts-flow

Expressive and engaging text-to-speech, generated from a short audio sample.

Model

TTS

776

8mo

NVIDIA

Downloadable

magpie-tts-multilingual

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

Model

TTS

32.45K

8mo

NVIDIA

API Endpoint

magpie-tts-zeroshot

Expressive and engaging text-to-speech, generated from a short audio sample.

Model

TTS

1.26K

9mo

NVIDIA

Downloadable

maisi

MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.

Model

Image Generation

735

11mo

NVIDIA

Downloadable

megatron-1b-nmt

Enable smooth global interactions in 36 languages.

Model

Neural machine translation

11mo

Mistral AI

API Endpoint

mistral-7b-instruct-v0.2

This LLM follows instructions, completes requests, and generates creative text.

Model

chat

567K

9mo

NVIDIA

API Endpoint

nv-dinov2

NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.

Model

computer vision

1.18M

11mo

NVIDIA

API Endpoint

nv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

Model

Object Detection

3.6K

11mo

NVIDIA

Downloadable

parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

Model

Automatic Speech Recognition

36.22K

10mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

Model

ASR

8.48K

9mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

Model

ASR

6mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

Model

ASR

743

6mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.

Model

ASR

7.64K

6mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

Model

ASR

419

4mo

NVIDIA

Downloadable

parakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

Model

ASR

45.51K

8mo

NVIDIA

Downloadable

parakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

Model

ASR

2.66K

7mo

NVIDIA

API Endpoint

retail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

Model

Object Detection

363

NVIDIA

Downloadable

riva-translate-1.6b

Enable smooth global interactions in 36 languages.

Model

Neural machine translation

446K

8mo

NVIDIA

API Endpoint

riva-translate-4b-instruct-v1_1

Translation model in 12 languages with few-shots example prompts capability.

Model

nvidia nim

567K

3mo

NVIDIA

API Endpoint

visual-changenet

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

Model

image

640

OpenAI

Downloadable

whisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

Model

ASR

54.31K

11mo

Items per page

of 1 pages