⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image Generation

Text-to-Image

Image-to-Text

Drug Discovery

Code Generation

Inference Providers

Deep Infra

Together AI

Bitdeer AI

GMI Cloud

CoreWeave

Publisher

Black forest labs

Google

NVIDIA

Mistral AI

Microsoft

NIM Container GPUs

H100 80GB HBM3

L40S

B200

A100 SXM4 80GB

H200

11 models

Sort By

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

code generation

Items per page

of 1 pages

16.19M

2mo

Black-forest-labs

Downloadable

flux.2-klein-4b

FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed

image editing

270K

2mo

NVIDIA

DownloadableFree Endpoint

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

language generation

2.44M

7mo

Stability AI

Downloadable

stable-diffusion-3.5-large

Stable Diffusion 3.5 is a popular text-to-image generation model

Text-to-Image

9mo

Black-forest-labs

Downloadable

FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Text-to-Image

3.63K

9mo

Google

Free Endpoint

gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

3.74M

10mo

Google

Free Endpoint

gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

43.86M

10mo

Black-forest-labs

Downloadable

FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Text-to-Image

265K

11mo

Black-forest-labs

Downloadable

FLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Text-to-Image

257K

11mo

Microsoft

Free Endpoint

phi-4-multimodal-instruct

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Speech Recognition

269K

Google

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image

10.29K