Try NVIDIA NIM APIs

Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

4 results for

Filters

Free Endpoint

2

Partner Endpoint

3

Download Available

3

Use Case

Speech-to-Text

1

Inference Providers

Deepinfra

3

OpenRouter

3

Together AI

3

CoreWeave

2

Digital Ocean

2

Publisher

OpenAI

3

NVIDIA

1

Sort By

Downloadable

whisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

Items per page

of 1 pages

161K

1y

DownloadableFree Endpoint

gpt-oss-120b

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

45M

11mo

DownloadableFree Endpoint

gpt-oss-20b

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

18M

11mo

30 MIN

Run models with llama.cpp on DGX Spark

Build llama.cpp with CUDA and serve models via an OpenAI-compatible API

3mo