Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
57 results for
Filters
Models (57)
Blueprints (7)
Other (9)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
llama-3.2-nemoretriever-1b-vlm-embed-v1
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
271K
8mo
Qwen
Downloadable
qwen3.5-397b-a17b
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
Model
chat
+4
8.02M
3w
NVIDIA
Free Endpoint
cosmos-nemotron-34b
Multi-modal vision-language model that understands text/img/video and creates informative responses
Model
VLM
+3
6
1y
Google
Free Endpoint
paligemma
Vision language model adept at comprehending text and visual inputs to produce informative responses
Model
image
+8
335K
1y
NVIDIA
Free Endpoint
retail-object-detection
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Model
Object Detection
+8
363
1y
NVIDIA
Free Endpoint
visual-changenet
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
Model
image
+8
640
1y
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+4
17.73M
1mo
Z.ai
Downloadable
glm-5
GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
Model
chat
+3
9.8M
4w
NVIDIA
Free Endpoint
ocdrnet
OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
Model
Optical Character Recognition
+8
736
1y
NVIDIA
Downloadable
nemotron-nano-12b-v2-vl
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Model
chat
+4
1.4M
4mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-vl-8b-v1
Multi-modal vision-language model that understands text/img and creates informative responses
Model
chat
+3
9.15M
8mo
NVIDIA
Downloadable
llama-nemotron-embed-vl-1b-v2
Multimodal question-answer retrieval representing user queries as text and documents as images.
Model
nemo retriever
+3
883K
1mo
Mistral AI
Downloadable
ministral-14b-instruct-2512
A general purpose VLM ideal for chat and instruction based use cases
Model
chat
+4
4.67M
3mo
NVIDIA
Downloadable
cosmos-reason1-7b
Reasoning vision language model (VLM) for physical AI and robotics.
Model
video understanding
+8
15.93K
7mo
NVIDIA
Downloadable
nemoguard-jailbreak-detect
Industry leading jailbreak classification model for protection from adversarial attempts
Model
nemo guardrails
+6
79.59K
8mo
Meta
Free Endpoint
llama-guard-4-12b
Multi-modal model to classify safety for input prompts as well output responses.
Model
LLM Multimodal Safety
+3
495K
8mo
Mistral AI
Free Endpoint
mistral-large-3-675b-instruct-2512
A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
Model
chat
+4
6.69M
3mo
TokyoTech-LLM
Downloadable
llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
Model
chat
+3
539K
9mo
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-content-safety
Leading content safety model for enhancing the safety and moderation capabilities of LLMs
Model
nemo guardrails
+4
574K
11mo
NVIDIA
Downloadable
llama-3.1-nemoguard-8b-topic-control
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
Model
nemo guardrails
+4
549K
11mo
NVIDIA
Free Endpoint
llama-3.1-nemotron-safety-guard-8b-v3
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
Model
content moderation
+4
607K
4mo
MediaTek
Free Endpoint
breeze-7b-instruct
LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
Model
chat
+3
579K
9mo
Igenius
Free Endpoint
colosseum_355b_instruct_16k
NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
Model
chat
+4
81.66K
9mo
DeepSeek AI
Free Endpoint
deepseek-v3.1-terminus
DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
Model
chat
+4
13.24M
5mo
Items per page
24
1
1
2
2
3
3
of 3 pages