Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
43 results for
Filters (1)
Models (43)
Blueprints (0)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
Downloadable
cosmos-reason2-8b
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Model
video understanding
+8
124K
3mo
DeepSeek AI
Downloadable
deepseek-r1-distill-llama-8b
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
Distillation
+5
2.31M
8mo
DeepSeek AI
Downloadable
deepseek-r1-distill-qwen-14b
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
coding
+4
1.88K
2.17M
10mo
DeepSeek AI
Downloadable
deepseek-r1-distill-qwen-32b
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
coding
+4
2.49K
2.54M
10mo
DeepSeek AI
Free Endpoint
deepseek-v3.1
DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
Model
chat
+2
11.34M
7mo
DeepSeek AI
Free Endpoint
deepseek-v3.1-terminus
DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
Model
chat
+4
13.2M
5mo
DeepSeek AI
Free Endpoint
deepseek-v3.2
State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
Model
chat
+3
15.74M
3mo
Mistral AI
Free Endpoint
devstral-2-123b-instruct-2512
State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
Model
coding
+4
4.65M
3mo
Tiiuae
Free Endpoint
falcon3-7b-instruct
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Model
chat
+6
1.74M
10mo
Google
Downloadable
gemma-4-31b-it
Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
Model
coding
+4
1d
Z.ai
Free Endpoint
glm-4.7
GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
Model
Tool Calling
+4
14.39M
2mo
Z.ai
Downloadable
glm-5
GLM-5 744B MoE enables efficient reasoning for complex systems and long-horizon agentic tasks.
Model
MoE
+3
35.45M
1mo
OpenAI
Downloadable
gpt-oss-120b
Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
Model
reasoning
+4
44.86M
8mo
OpenAI
Downloadable
gpt-oss-20b
Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
Model
reasoning
+4
8.25M
8mo
IBM
Free Endpoint
granite-3.3-8b-instruct
Small language model fine-tuned for improved reasoning, coding, and instruction-following
Model
coding
+3
78.41K
8mo
Moonshotai
Free Endpoint
kimi-k2-instruct
State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
Model
coding
+4
20.61M
8mo
Moonshotai
Free Endpoint
kimi-k2-instruct-0905
Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
Model
long-context
+4
14M
6mo
Moonshotai
Free Endpoint
kimi-k2-thinking
Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
Model
Conversational
+4
3.55M
3mo
Moonshotai
Downloadable
kimi-k2.5
1T multimodal MoE for high‑capacity video and image understanding with efficient inference.
Model
Multimodal
+4
42.89M
2mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-4b-v1.1
State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
Model
chat
+4
104K
9mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
Model
chat
+4
358K
9mo
NVIDIA
Downloadable
llama-3.1-nemotron-ultra-253b-v1
Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
Model
chat
+4
5.84M
8mo
Meta
Downloadable
llama-3.3-70b-instruct
Advanced LLM for reasoning, math, general knowledge, and function calling
Model
Instruction following
+5
16.65M
9mo
NVIDIA
Downloadable
llama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Model
chat
+4
886K
8mo
Items per page
24
1
1
2
2
of 2 pages