Try NVIDIA NIM APIs

⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

4 results for

Filters

Free Endpoint

3

Download Available

1

Use Case

Image-to-Text

4

Publisher

Google

1

Meta

1

Mistral AI

1

NVIDIA

1

Sort By

DeprecatedFree Endpoint

mistral-medium-3-instruct

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

language generation

Items per page

of 1 pages

63.59K

10mo

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

15.84K

1y

Downloadable

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generation

2.89M

6mo

Free Endpoint

llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generation

26.82M

10mo