High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
Reasoning vision language model (VLM) for physical AI and robotics.
Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math
Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities
High performance reasoning model optimized for efficiency and edge deployment
Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.
State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.
Small language model fine-tuned for improved reasoning, coding, and instruction-following
Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
Superior inference efficiency with highest accuracy for scientific and complex math reasoning, coding, tool calling, and instruction following.
Build a custom deep researcher powered by state-of-the-art models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Leading reasoning and agentic AI accuracy model for PC and edge.
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
Advanced LLM for reasoning, math, general knowledge, and function calling
Efficient hybrid state-space model designed for conversational and reasoning tasks.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
An MOE LLM that follows instructions, completes requests, and generates creative text.
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
An MOE LLM that follows instructions, completes requests, and generates creative text.