⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

1

Partner Endpoint

2

Download Available

2

Use Case

Code Generation

1

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Speech-to-Text

0

Inference Providers

Deep Infra

1

Together AI

1

CoreWeave

1

Bitdeer AI

0

GMI Cloud

0

Publisher

NVIDIA

1

Mistral AI

1

Microsoft

1

Meta

0

Qwen

0

GPU Types

A100 SXM4 80GB

0

B200

0

GB200

0

GH200 144G HBM3e

0

H100 80GB HBM3

0

Labels (2)

language generation

Chat

3 models

Sort By

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Items per page

of 1 pages

579K

11mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

456K

1y

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

512K

10mo