⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

Partner Endpoint

Download Available

Use Case

Retrieval Augmented Generation

Image-to-Text

Text-to-Embedding

Code Generation

Drug Discovery

Inference Providers

Deep Infra

Bitdeer AI

Together AI

Fireworks AI

GMI Cloud

Publisher

NVIDIA

nemotron-nano-12b-v2-vl

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

chat

784K

5mo

llama-guard-4-12b

Multi-modal model to classify safety for input prompts as well output responses.

LLM Multimodal Safety

167K

9mo

NVIDIA

Downloadable

llama-3.1-nemotron-nano-vl-8b-v1

Multi-modal vision-language model that understands text/img and creates informative responses

chat

9.84M

9mo

NVIDIA

Free Endpoint

bevformer

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

autonomous vehicles

117

8mo

NVIDIA

Downloadable

canary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech Recognition

5.99K

11mo

Abacus.AI

Free Endpoint

dracarys-llama-3.1-70b-instruct

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

chat

330K

10mo

BAAI

Downloadable

bge-m3

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Embeddings

7.52M

11mo

Items per page

of 1 pages