Explore

Models

Skills

Blueprints

GPUs

Docs

Your Privacy Choices

Contact

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image Generation

Text-to-Image

Drug Discovery

Image-to-Text

Retrieval Augmented Generation

Inference Providers

Deepinfra

OpenRouter

Together AI

GMI Cloud

Lightning AI

Publisher

Mistral AI

Qwen

Black forest labs

ByteDance

NVIDIA

NIM Container GPUs

H100 80GB HBM3

B200

L40S

H200

A100 SXM4 80GB

4 models

Sort By

Mistral AI

DownloadableFree Endpoint

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

code generation

Items per page

of 1 pages

13M

3mo

Qwen

DownloadableFree Endpoint

qwen3-next-80b-a3b-instruct

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

text-generation

32M

9mo

ByteDance

Free Endpoint

seed-oss-36b-instruct

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

thinking budget

9mo

Black-forest-labs

Downloadable

FLUX.1-Kontext-dev

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Text-to-Image

10mo