⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

1

Partner Endpoint

1

Download Available

1

Use Case

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Code Generation

0

Speech-to-Text

0

Inference Providers

Deep Infra

1

Together AI

1

Bitdeer AI

0

GMI Cloud

0

CoreWeave

0

Publisher

NVIDIA

1

ByteDance

1

Meta

0

Mistral AI

0

Qwen

0

GPU Types

A100 SXM4 80GB

0

B200

0

GB200

0

GH200 144G HBM3e

0

H100 80GB HBM3

0

Labels (2)

reasoning

thinking budget

2 models

Sort By

Free Endpoint

seed-oss-36b-instruct

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

thinking budget

7mo

Items per page

of 1 pages

1.08M

Downloadable

nvidia-nemotron-nano-9b-v2

High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.

thinking budget

377K

8mo