Explore
Models
Blueprints
GPUs
Docs
⌘K
Ctrl+K
?
Login
70 results for
Filters
Models (70)
Blueprints (0)
Other (0)
Sort By
score:DESC
Best Match
NVIDIA
Free Endpoint
nemotron-content-safety-reasoning-4b
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
Model
NeMo Guardrails
+3
207K
2mo
Microsoft
Deprecated
Free Endpoint
phi-4-mini-flash-reasoning
Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
Model
edge
+3
162K
9mo
IBM
Deprecated
Free Endpoint
granite-3.3-8b-instruct
Small language model fine-tuned for improved reasoning, coding, and instruction-following
Model
coding
+2
9mo
Mistral AI
Free Endpoint
magistral-small-2506
High performance reasoning model optimized for efficiency and edge deployment
Model
coding
+3
1.18M
9mo
Moonshotai
Free Endpoint
kimi-k2-instruct
State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
Model
coding
+3
21.04M
9mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-8b-v1
Leading reasoning and agentic AI accuracy model for PC and edge.
Model
math
+3
622K
9mo
Marin
Deprecated
Free Endpoint
marin-8b-instruct
State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.
Model
Reasoning
+3
148K
10mo
NVIDIA
Downloadable
nvidia-nemotron-nano-9b-v2
High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
Model
thinking budget
+1
285K
7mo
Qwen
Downloadable
qwen3-next-80b-a3b-thinking
80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.
Model
Reasoning
+1
1.86M
7mo
ByteDance
Free Endpoint
seed-oss-36b-instruct
ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
Model
thinking budget
+2
1.36M
7mo
Stepfun-ai
Free Endpoint
step-3.5-flash
200B open-source reasoning engine with sparse MoE powering frontier agentic AI.
Model
Agentic
+2
9.4M
2mo
NVIDIA
Downloadable
llama-3.1-nemotron-nano-4b-v1.1
State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
Model
edge
+3
82.25K
9mo
NVIDIA
Deprecation in 7d
Downloadable
llama-3.1-nemotron-ultra-253b-v1
Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
Model
math
+3
5.15M
9mo
NVIDIA
Free Endpoint
ising-calibration-1-35b-a3b
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
Model
Quantum
+3
Today
Mistral AI
Deprecated
Downloadable
mistral-small-24b-instruct
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
Model
code
+3
198K
9mo
Sarvamai
Downloadable
sarvam-m
Multilingual, hybrid-reasoning model optimized for Indian language tasks, programming, mathematical reasoning capabilities.
Model
coding
+5
134K
8mo
Qwen
Deprecated
Free Endpoint
qwq-32b
Powerful reasoning model capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems.
Model
coding
+3
1.13M
9mo
DeepSeek AI
Deprecated
Downloadable
deepseek-r1-distill-llama-8b
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
Distillation
+4
1.46M
9mo
DeepSeek AI
Deprecated
Downloadable
deepseek-r1-distill-qwen-14b
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
coding
+3
2.45K
1.29M
10mo
DeepSeek AI
Deprecated
Downloadable
deepseek-r1-distill-qwen-32b
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model
coding
+3
43.07K
1.74M
10mo
DeepSeek AI
Free Endpoint
deepseek-v3.2
State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
Model
long context
+2
14.89M
4mo
Mistral AI
Free Endpoint
devstral-2-123b-instruct-2512
State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
Model
coding
+3
3.62M
4mo
Tiiuae
Deprecated
Free Endpoint
falcon3-7b-instruct
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Model
Coding
+5
696K
10mo
Google
Downloadable
gemma-4-31b-it
Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
Model
coding
+3
1.47M
1w
Items per page
24
1
1
2
2
3
3
of 3 pages