Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
Model for writing and interacting with code across a wide range of programming languages and tasks.
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
This LLM follows instructions, completes requests, and generates creative text.
An MOE LLM that follows instructions, completes requests, and generates creative text.
This LLM follows instructions, completes requests, and generates creative text.
An MOE LLM that follows instructions, completes requests, and generates creative text.