Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
3 models
Sort By
dateCreated:DESC
Most Recent
Google
Downloadable
gemma-3-1b-it
A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
chat
+4
4.34K
554K
9mo
NVIDIA
Downloadable
canary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.
Automatic Speech Recognition
+3
5.1K
11mo
THUDM
Free Endpoint
chatglm3-6b
Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
chat
+5
618K
8mo
Items per page
24
1
1
of 1 pages