Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
14 results for
Filters
Models (14)
Blueprints (2)
Other (0)
Sort By
score:DESC
Best Match
Microsoft
Free Endpoint
phi-4-multimodal-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Model
Speech Recognition
+4
Items per page
24
1
1
of 1 pages
450K
11mo
Mistral AI
Deprecation in 8d
Free Endpoint
mistral-medium-3-instruct
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
Model
language generation
+3
834K
9mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
Model
language generation
+3
1.83M
5mo
Mistral AI
Free Endpoint
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
Model
language generation
+3
3.92M
5mo
Moonshotai
Downloadable
kimi-k2.6
1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.
Model
Multimodal
+3
102K
2d
NVIDIA
Downloadable
nvclip
NV-CLIP is a multimodal embeddings model for image and text.
Model
Computer vision
+3
39.62K
10mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
189K
10mo
NVIDIA
Free Endpoint
nemotron-3-content-safety
Multilingual, multimodal model for detecting unsafe and toxic content.
Model
llm safety
+3
35.6K
2w
Black-forest-labs
Downloadable
FLUX.1-Kontext-dev
FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
Model
Text-to-Image
+2
2.94K
8mo
Google
Deprecation in 9d
Free Endpoint
gemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Model
Vision Assistant
+3
4.07M
11mo
Meta
Free Endpoint
llama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
Model
language generation
+3
15.53M
9mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
6.95M
2mo
Mistral AI
Downloadable
mistral-small-4-119b-2603
Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
Model
code generation
+2
9.25M
1mo
Qwen
Downloadable
qwen3.5-122b-a10b
122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
Model
tool calling
+3
7.97M
1mo