Try NVIDIA NIM APIs

Skip to main content

⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

3 results for

Filters (2)

Free Endpoint

3

Partner Endpoint

2

Download Available

0

Use Case

Image-to-Text

1

Image Generation

0

Text-to-Image

0

Synthetic Data Generation

0

Optical Character Recognition

0

Inference Providers

Deep Infra

1

Bitdeer AI

1

Together AI

1

GMI Cloud

0

Vultr

0

Publisher

Google

2

Microsoft

1

NVIDIA

0

Black forest labs

0

Qwen

0

NIM Container GPUs

B200

0

H100 80GB HBM3

0

H100 NVL

0

H200

0

L40S

0

Labels (2)

language generation

Speech Recognition

Sort By

Free Endpoint

phi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Speech Recognition

Items per page

of 1 pages

269K

1y

Free Endpoint

gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

43.86M

10mo

Free Endpoint

gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

3.74M

10mo