Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.