⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

4

Partner Endpoint

3

Download Available

1

Use Case

Image-to-Text

3

Code Generation

0

Drug Discovery

0

Retrieval Augmented Generation

0

Object Detection

0

Inference Providers

Fireworks AI

3

Bitdeer AI

2

Deep Infra

1

Together AI

1

GMI Cloud

1

Publisher

Meta

2

Mistral AI

1

AI21 Labs

1

NVIDIA

0

Microsoft

0

API Catalog Type

Enterprise

0

Blueprint Type

NVIDIA BioNemo

0

Labels (1)

language generation

4 models

Sort By

Free Endpoint

mistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

7.73M

3mo

Free Endpoint

llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

4.91M

8mo

DownloadableFree Endpoint

llama-4-scout-17b-16e-instruct

A multimodal, multilingual 16 MoE model with 17B parameters.

language generation

25.06K

8mo

Free Endpoint

jamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

582K

10mo

Items per page

of 1 pages