Skip to main content

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

3

Partner Endpoint

2

Download Available

3

Use Case

Code Generation

1

Drug Discovery

0

Image-to-Text

0

Retrieval Augmented Generation

0

Speech-to-Text

0

Inference Providers

Deepinfra

2

OpenRouter

1

Together AI

1

GMI Cloud

0

CoreWeave

0

Publisher

NVIDIA

2

Mistral AI

1

Meta

0

Google

0

Qwen

0

NIM Container GPUs

H100 80GB HBM3

3

B200

3

H200

3

L40S

3

A100 SXM4 80GB

3

Labels (2)

reasoning

Advanced Reasoning

3 models

Sort By

DownloadableFree Endpoint

llama-3.3-nemotron-super-49b-v1.5

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

advanced reasoning

Items per page

of 1 pages

3M

11mo

DownloadableFree Endpoint

llama-3.3-nemotron-super-49b-v1

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

advanced reasoning

5M

11mo

DownloadableFree Endpoint

mixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

Advanced Reasoning

996K

11mo