Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
66 results for
Filters (1)
Models (65)
Blueprints (1)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
Launchable
Enterprise
Build a Video Search and Summarization (VSS) Agent
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Blueprint
NVIDIA AI
+4
2mo
Items per page
24
1
1
2
2
3
3
of 3 pages
DeepSeek AI
Deprecation in 3d
Free Endpoint
deepseek-v3.1-terminus
DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
Model
tool calling
+3
5.7M
6mo
DeepSeek AI
Deprecation in 3d
Free Endpoint
deepseek-v3.2
State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
Model
long context
+2
7.69M
4mo
DeepSeek AI
Downloadable
deepseek-v4-flash
DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
Model
coding
+3
1.86M
1w
DeepSeek AI
Downloadable
deepseek-v4-pro
DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.
Model
Moe
+3
2.08M
1w
Mistral AI
Deprecation in 10d
Free Endpoint
devstral-2-123b-instruct-2512
State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
Model
coding
+3
2.62M
4mo
Abacus.AI
Free Endpoint
dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
Model
Code Generation
+1
433K
11mo
Google
Free Endpoint
gemma-2-2b-it
Advanced small language generative AI model for edge applications
Model
Chat
+3
478K
11mo
Google
Deprecation in 11d
Free Endpoint
gemma-3-27b-it
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Model
Vision Assistant
+3
4.07M
11mo
Google
Free Endpoint
gemma-3n-e2b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
Model
language generation
+3
395K
9mo
Google
Free Endpoint
gemma-3n-e4b-it
An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
Model
language generation
+3
1.3M
9mo
Google
Downloadable
gemma-4-31b-it
Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
Model
coding
+3
4.54M
1mo
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+3
7.38M
2w
Z.ai
Downloadable
glm-5.1
GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
Model
Agentic AI
+3
6.33M
2w
OpenAI
Downloadable
gpt-oss-120b
Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
Model
reasoning
+3
27.85M
9mo
OpenAI
Downloadable
gpt-oss-20b
Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
Model
reasoning
+3
11.37M
9mo
NVIDIA
Downloadable
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Model
Quantum
+3
149K
2w
Moonshotai
Deprecation in 11d
Free Endpoint
kimi-k2-instruct
State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
Model
coding
+3
12.72M
9mo
Moonshotai
Deprecation in 4d
Free Endpoint
kimi-k2-instruct-0905
Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
Model
long-context
+3
7.75M
7mo
Moonshotai
Deprecation in 11d
Free Endpoint
kimi-k2-thinking
Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
Model
Conversational
+3
2.82M
4mo
Moonshotai
Downloadable
kimi-k2.6
1T multimodal MoE for long-horizon coding, agentic tool use, and image/video understanding.
Model
Multimodal
+3
102K
1d
Meta
Downloadable
llama-3.1-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Model
Chat
+3
2.3M
10mo
Meta
Downloadable
llama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
Model
Chat
+4
14.29M
9mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
Model
math
+3
999K
10mo