Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Contact

Explore

⌘KCtrl+K

34 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Launchable

Developer Example

Enterprise Blueprint

Use Case

Speech-to-Text

Code Generation

Optical Character Recognition

Image-to-Text

Digital Twin

Inference Providers

Deep Infra

Together AI

Bitdeer AI

GMI Cloud

Publisher

NVIDIA

Meta

Cyborg

DeepSeek AI

Google

Blueprint Type

NVIDIA AI

NVIDIA Omniverse

Sort By

OpenAI

Downloadable

whisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

Model

ASR

Items per page

of 2 pages

77.68K

NVIDIA

Downloadable

conformer-ctc-asr

Automatic speech recognition model that transcribes speech in lower case Spanish with record-setting accuracy and performance

Model

ASR

NVIDIA

Downloadable

parakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

Model

ASR

71.47K

11mo

NVIDIA

Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Model

Automatic Speech Recognition

28.35K

NVIDIA

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Model

Automatic Speech Recognition

16.6K

2mo

NVIDIA

Downloadable

parakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

Model

Automatic Speech Recognition

26.6K

General

Developer Example

Nemotron Voice Agent

Build Real-Time Voice Agents with NVIDIA Nemotron NIM.

Blueprint

Voice Agent

2mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

Model

ASR

1.39K

8mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

Model

ASR

130

8mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.

Model

ASR

7.28K

8mo

NVIDIA

Downloadable

parakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

Model

ASR

1.57K

7mo

DeepSeek AI

Downloadable

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.

Model

coding

12.88M

1mo

NVIDIA

Downloadable

parakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

Model

ASR

49.29K

10mo

Healthcare & Life Sciences

LaunchableDeveloper Example

Ambient Healthcare Agents

Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM

Blueprint

NVIDIA AI

3mo

General

LaunchableEnterprise

Build an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Blueprint

NVIDIA AI

3mo

Cyborg

Deprecation in 25dLaunchable

Cyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

Blueprint

NIM

3mo

Meta

Downloadable

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Model

chat

39.54K409K

Meta

Downloadable

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Model

Chat

23.31K1.34M

Meta

Free Endpoint

llama-guard-4-12b

Multi-modal model to classify safety for input prompts as well output responses.

Model

LLM Multimodal Safety

138K

11mo

NVIDIA

Downloadable

llama-nemotron-embed-vl-1b-v2

Multimodal question-answer retrieval representing user queries as text and documents as images.

Model

nemo retriever

7.15M

3mo

General

LaunchableDeveloper Example

LLM Router

Route LLM requests to the best model for the task at hand.

Blueprint

NVIDIA AI

3mo

DGX Station

30 MINS

Local Coding Agent

Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)

Playbook

Coding

2mo

Mistral AI

Free Endpoint

mistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

Model

language generation

4.08M

5mo

NVIDIA

Downloadable

nemoretriever-ocr

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Model

Table Extraction

14.25K

10mo