⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

17

Partner Endpoint

10

Download Available

9

Use Case

Code Generation

15

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Object Detection

0

Inference Providers

Deep Infra

6

Together AI

5

CoreWeave

3

GMI Cloud

2

Bitdeer AI

1

Publisher

Microsoft

8

Meta

5

NVIDIA

3

Google

3

Qwen

2

API Catalog Type

Enterprise

0

Blueprint Type

NVIDIA BioNemo

0

Labels (2)

language generation

chat

26 models

Sort By

Free Endpoint

gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

429K

9mo

Free Endpoint

gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation

302K

9mo

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

351K

11mo

DeprecatedDownloadable

qwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

6.98M

11mo

DeprecatedFree Endpoint

nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

361K

11mo

Downloadable

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

16.14K993K

11mo

Downloadable

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

15.82K361K

11mo

DeprecatedFree Endpoint

qwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

129K

11mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

197K

1y

DeprecatedFree Endpoint

mistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generation

3.16K

1y

DeprecatedFree Endpoint

phi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

1.19M

1y

DeprecatedFree Endpoint

rakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

44.26K

11mo

DeprecatedFree Endpoint

rakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

44.4K

11mo

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

492K

11mo

Downloadable

llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

3.49M

10mo

Downloadable

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

14.72M

9mo

DeprecatedFree Endpoint

phi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

93.79K

11mo

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

879K

10mo

Free Endpoint

solar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Non-Commercial Use Only

161K

1y

DeprecatedFree Endpoint

phi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

43.15K

11mo

DeprecatedFree Endpoint

phi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

59.48K

11mo

DeprecatedFree Endpoint

phi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

54.5K

11mo

DeprecatedFree Endpoint

sea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

1y

DeprecatedDownloadable

phi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

76.3K

11mo

Items per page

of 2 pages