Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters (3)
1 model
Sort By
dateCreated:DESC
Most Recent
Microsoft
Deprecated
Free Endpoint
phi-4-mini-flash-reasoning
Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
edge
+3
156K
9mo
Items per page
24
1
1
of 1 pages