⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

15

Partner Endpoint

9

Download Available

9

Use Case

Code Generation

15

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Object Detection

0

Inference Providers

Deep Infra

6

Together AI

5

CoreWeave

3

GMI Cloud

2

Bitdeer AI

0

Publisher

Microsoft

8

Meta

5

NVIDIA

3

Qwen

2

Rakuten

2

Labels (2)

chat

language generation

24 models

Sort By

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Items per page

of 1 pages

374K

11mo

DeprecatedDownloadable

qwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

5.17M

11mo

DeprecatedFree Endpoint

nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

334K

11mo

Downloadable

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

12.93K923K

11mo

Downloadable

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

17.08K383K

11mo

DeprecatedFree Endpoint

qwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

115K

11mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

214K

1y

DeprecatedFree Endpoint

mistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generation

2.71K

1y

DeprecatedFree Endpoint

phi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

830K

1y

DeprecatedFree Endpoint

rakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

37.83K

11mo

DeprecatedFree Endpoint

rakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

37.8K

11mo

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

515K

11mo

Downloadable

llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

3.07M

10mo

Downloadable

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

15.93M

9mo

DeprecatedFree Endpoint

phi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

85.28K

11mo

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

911K

10mo

Free Endpoint

solar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Non-Commercial Use Only

176K

1y

DeprecatedFree Endpoint

phi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

35.89K

11mo

DeprecatedFree Endpoint

phi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

52.01K

11mo

DeprecatedFree Endpoint

phi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

47.36K

11mo

DeprecatedFree Endpoint

sea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

1y

DeprecatedDownloadable

phi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

68.06K

11mo

DeprecatedFree Endpoint

phi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

91.01K

11mo

DeprecatedDownloadable

llama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

599K

11mo