Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Optimized by NVIDIA
Launch from Hugging Face
Beta
Filters
5 models
Sort By
dateCreated:DESC
Most Recent
NVIDIA
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
doc intelligence
+3
7.51M
8mo
NVIDIA
cosmos-nemotron-34b
Multi-modal vision-language model that understands text/img/video and creates informative responses
VLM
+3
6
1y
Meta
llama-3.2-11b-vision-instruct
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+5
676K
9mo
Meta
llama-3.2-90b-vision-instruct
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Image-Text Retrieval
+5
579K
9mo
Google
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
image
+8
330K
1y
Items per page
24
1
1
of 1 pages