⌘KCtrl+K

Your Privacy Choices

Contact

Explore

Models

⌘KCtrl+K

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

Partner Endpoint

Download Available

Use Case

Image-to-Text

Code Generation

Retrieval Augmented Generation

Drug Discovery

Speech-to-Text

Inference Providers

Together AI

Deep Infra

Bitdeer AI

CoreWeave

GMI Cloud

Publisher

glm-4.7

GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.

Tool Calling

Items per page

of 1 pages

5.5M

Sarvamai

Downloadable

sarvam-m

Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.

coding

144K

9mo

Mistral AI

Deprecation in 12dFree Endpoint

magistral-small-2506

High performance reasoning model optimized for efficiency and edge deployment

coding

1.17M

9mo

llama-4-maverick-17b-128e-instruct

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

language generation

12.54M

9mo

Microsoft

Downloadable

phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Chat

532K

11mo