Try NVIDIA NIM APIs

Explore

Models

Skills

Blueprints

9 results for

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image Generation

Text-to-Image

Image-to-Text

Optical Character Recognition

Synthetic Data Generation

Inference Providers

Deepinfra

GMI Cloud

OpenRouter

Together AI

Bitdeer

Publisher

Qwen

NVIDIA

Google

Microsoft

Mistral AI

NIM Container GPUs

B200

GB200

Sort By

Qwen

Downloadable

qwen-image

Qwen-Image is a text-to-image foundation model with advanced multilingual text rendering.

Model

Text-to-Image

1mo

Items per page

of 1 pages

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

Model

code generation

13M

3mo

Qwen

DownloadableFree Endpoint

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

Model

tool calling

10M

3mo

NVIDIA

Free Endpoint

cosmos3-nano

Generates physics-aware videos from text prompts or an image prompt for physical AI development.

Model

autonomous vehicles

25d

Microsoft

Downloadable

TRELLIS

MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.

Model

text-to-3d

9mo

Qwen

DownloadableFree Endpoint

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

Model

MoE

13M

4mo

Google

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

Model

image

10K

Stability AI

Downloadable

stable-diffusion-3.5-large

Stable Diffusion 3.5 is a popular text-to-image generation model

Model

Text-to-Image

10mo

NVIDIA

Downloadable

nemotron-ocr-v2

Nemotron OCR v2 is a state-of-the-art multilingual text recognition model designed for robust end-to-end optical character recognition (OCR) on complex real-world images.

Model

Table Extraction

151