Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

3

Partner Endpoint

2

Download Available

2

Use Case

Image-to-Text

1

Drug Discovery

0

Retrieval Augmented Generation

0

Speech-to-Text

0

Code Generation

0

Inference Providers

Deepinfra

2

OpenRouter

2

GMI Cloud

2

Together AI

1

Vultr

1

Publisher

Qwen

2

Google

1

NVIDIA

0

Meta

0

Mistral AI

0

NIM Container GPUs

B200

1

GB200

1

H100 80GB HBM3

0

L40S

0

H200

0

Labels (1)

image

3 models

Sort By

DownloadableFree Endpoint

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

Items per page

of 1 pages

10M

3mo

DownloadableFree Endpoint

qwen3.5-397b-a17b

Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.

13M

4mo

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

10K

1y