Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

3

Partner Endpoint

2

Download Available

2

Use Case

Code Generation

3

Drug Discovery

0

Image-to-Text

0

Retrieval Augmented Generation

0

Speech-to-Text

0

Inference Providers

Deepinfra

2

OpenRouter

2

GMI Cloud

0

Bitdeer

0

Fireworks AI

0

Publisher

Meta

2

Google

1

NVIDIA

0

Mistral AI

0

Qwen

0

NIM Container GPUs

B200

2

H100 80GB HBM3

2

H200

2

L40S

2

A100 SXM4 80GB

2

3 models

Sort By

DownloadableFree Endpoint

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Language Generation

Items per page

of 1 pages

28.93K1.22M

1y

DownloadableFree Endpoint

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Language Generation

44.31K290K

1y

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

1.42M

1y