Try NVIDIA NIM APIs

Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

3 results for

Filters (1)

Free Endpoint

3

Download Available

1

Use Case

Image-to-Text

3

Publisher

Google

1

Meta

1

NVIDIA

1

Labels (1)

language generation

Sort By

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

Items per page

of 1 pages

10.22K

1y

Free Endpoint

llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generation

20.32M

11mo

DownloadableFree Endpoint

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generation

2.47M

7mo