⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (2)

Free Endpoint

17

Partner Endpoint

10

Download Available

9

Use Case

Code Generation

16

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Object Detection

0

Inference Providers

Deep Infra

7

Together AI

5

CoreWeave

3

GMI Cloud

2

Bitdeer AI

0

Publisher

Microsoft

8

Meta

5

NVIDIA

3

Mistral AI

2

Qwen

2

API Catalog Type

Enterprise

0

Blueprint Type

NVIDIA BioNemo

0

Labels (2)

Text-to-text

language generation

26 models

Sort By

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

275K

10mo

DeprecatedDownloadable

qwen2.5-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

7.42M

10mo

DeprecatedFree Endpoint

nemotron-4-mini-hindi-4b-instruct

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

387K

10mo

Downloadable

llama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

21.34K893K

10mo

Downloadable

llama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

15.76K236K

10mo

DeprecatedFree Endpoint

qwen2-7b-instruct

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Chinese Language Generation

154K

10mo

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

139K

1y

DeprecatedFree Endpoint

mistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generation

3.73K

1y

DeprecatedFree Endpoint

phi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

2.36M

1y

DeprecatedFree Endpoint

rakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

46.16K

10mo

DeprecatedFree Endpoint

rakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

46.73K

10mo

Free Endpoint

gemma-2-2b-it

Advanced small language generative AI model for edge applications

265K

10mo

Downloadable

llama-3.1-70b-instruct

Powers complex conversations with superior contextual understanding, reasoning and text generation.

3.75M

10mo

Downloadable

llama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

12.66M

9mo

DeprecatedFree Endpoint

phi-3-medium-128k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

97K

10mo

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

721K

10mo

Free Endpoint

solar-10.7b-instruct

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Non-Commercial Use Only

89.61K

1y

DeprecatedFree Endpoint

phi-3-small-8k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

45.5K

10mo

DeprecatedFree Endpoint

phi-3-small-128k-instruct

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

61.97K

10mo

DeprecatedFree Endpoint

phi-3-medium-4k-instruct

Cutting-edge lightweight open language model exceling in high-quality reasoning.

56.67K

10mo

Deprecation in 1dFree Endpoint

sea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

1y

DeprecatedDownloadable

phi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

80.07K

10mo

DeprecatedFree Endpoint

phi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

108K

10mo

DeprecatedDownloadable

llama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

784K

10mo

Items per page

of 2 pages