⌘KCtrl+K

Terms of Use

Privacy Policy

Your Privacy Choices

Contact

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

2

Partner Endpoint

2

Download Available

0

Use Case

Code Generation

0

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Object Detection

0

Inference Providers

Together AI

2

Deep Infra

1

Bitdeer AI

0

GMI Cloud

0

CoreWeave

0

Publisher

Mistral AI

1

Qwen

1

NVIDIA

0

magistral-small-2506

High performance reasoning model optimized for efficiency and edge deployment

coding

1.25M

9mo

Qwen

DeprecatedFree Endpoint

qwq-32b

Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.

coding

989K

9mo

Items per page

of 1 pages