⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters (1)

Free Endpoint

1

Partner Endpoint

0

Download Available

0

Use Case

Code Generation

0

Retrieval Augmented Generation

0

Drug Discovery

0

Image-to-Text

0

Object Detection

0

Inference Providers

Deep Infra

0

Together AI

0

Bitdeer AI

0

GMI Cloud

0

CoreWeave

0

Publisher

Microsoft

1

NVIDIA

0

Mistral AI

0

Meta

0

Qwen

0

Labels (1)

edge

1 model

Sort By

DeprecatedFree Endpoint

phi-4-mini-flash-reasoning

Lightweight reasoning model for applications in latency bound, memory/compute constrained environments

9mo

Items per page

of 1 pages

139K