A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Advanced Small Language Model supporting RAG, summarization, classification, code, and agentic AI
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.
zyphra/zamba2-7b-instruct
Efficient hybrid state-space model designed for conversational and reasoning tasks.
institute-of-science-tokyo/llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Unique language model that delivers an unmatched accuracy-efficiency performance.
qwen/qwen2-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
yentinglin/llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
tokyotech-llm/llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
ai21labs/jamba-1.5-mini-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
ai21labs/jamba-1.5-large-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
nvidia/nemotron-mini-4b-instruct
Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
microsoft/phi-3.5-moe-instruct
Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation
microsoft/phi-3.5-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
rakuten/rakutenai-7b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
rakuten/rakutenai-7b-chat
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
google/gemma-2-2b-it
Advanced small language generative AI model for edge applications
nvidia/usdcode-llama3-70b-instruct
State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
thudm/chatglm3-6b
Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
baichuan-inc/baichuan2-13b-chat
Support Chinese and English chat, coding, math, instruction following, solving quizzes
Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
nv-mistralai/mistral-nemo-12b-instruct
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
microsoft/phi-3-medium-128k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
google/gemma-2-27b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
google/gemma-2-9b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
nvidia/llama3-chatqa-1.5-70b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
nvidia/llama3-chatqa-1.5-8b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
01-ai/yi-large
Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.
nvidia/nemotron-4-340b-instruct
Creates diverse synthetic data that mimics the characteristics of real-world data.
mistralai/mistral-7b-instruct-v0.3
This LLM follows instructions, completes requests, and generates creative text.
upstage/solar-10.7b-instruct
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
mediatek/breeze-7b-instruct
LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
microsoft/phi-3-small-8k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
microsoft/phi-3-small-128k-instruct
Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
microsoft/phi-3-medium-4k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
aisingapore/sea-lion-7b-instruct
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
microsoft/phi-3-mini-4k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
databricks/dbrx-instruct
A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.
snowflake/arctic
Delivers high efficiency inference for enterprise applications focused on SQL generation and coding.
microsoft/phi-3-mini-128k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
mistralai/mixtral-8x22b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.
meta/llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
meta/llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
google/recurrentgemma-2b
Novel recurrent architecture based language model for faster inference when generating long sequences.
google/codegemma-7b
Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.
google/gemma-2b
Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.
google/gemma-7b
Cutting-edge text generation model text understanding, transformation, and code generation.
meta/codellama-70b
LLM capable of generating code from natural language and vice versa.
meta/llama2-70b
Cutting-edge large language AI model capable of generating text and code in response to prompts.
mistralai/mixtral-8x7b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.