Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
7 results for
Filters (1)
Models (7)
Blueprints (1)
Other (2)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Model
video understanding
+8
Items per page
24
1
1
of 1 pages
315K
4mo
Google
Deprecation in 11d
Free Endpoint
gemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Model
Vision Assistant
+3
4.07M
11mo
NVIDIA
Downloadable
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Model
Quantum
+3
149K
2w
Meta
Free Endpoint
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
Model
language generation
+3
15.53M
9mo
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
language generation
+3
5.11M
6mo
NVIDIA
Downloadable
nvclip
NV-CLIP is a multimodal embeddings model for image and text.
Model
Computer vision
+3
39.62K
10mo
Google
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
22.13K
1y