⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image-to-Text

Code Generation

Retrieval Augmented Generation

Drug Discovery

Speech-to-Text

Inference Providers

Deep Infra

Together AI

Bitdeer AI

CoreWeave

GMI Cloud

Publisher

Mistral AI

NVIDIA

Google

Microsoft

mistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

language generation

Items per page

of 1 pages

3.9M

5mo

Mistral AI

Downloadable

ministral-14b-instruct-2512

A general purpose VLM ideal for chat and instruction based use cases

language generation

1.62M

5mo

Microsoft

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Chat

532K

11mo

NVIDIA

Free Endpoint

nemotron-mini-4b-instruct

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

Chat

291K

Mistral AI

Downloadable

mistral-7b-instruct-v0.3

This LLM follows instructions, completes requests, and generates creative text.

Chat

551K

10mo

Google

Free Endpoint

paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image

30.26K