nvidia/llama-3.1-8b-aegis-v2
Guardrail model to ensure that responses from LLMs are appropriate and safe
nvidia/llama-3.1-nemotron-70b-instruct
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.
qwen/qwen2-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
tokyotech-llm/llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
ai21labs/jamba-1.5-mini-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
ai21labs/jamba-1.5-large-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
microsoft/phi-3.5-moe-instruct
Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation
microsoft/phi-3.5-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
rakuten/rakutenai-7b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
rakuten/rakutenai-7b-chat
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
writer/palmyra-fin-70b-32k
Specialized LLM for financial analysis, reporting, and data processing
google/shieldgemma-9b
Guardrail model to ensure that responses from LLMs are appropriate and safe
nvidia/usdcode-llama3-70b-instruct
State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
meta/llama-3.1-405b-instruct
Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
nvidia/llama3-chatqa-1.5-70b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
nvidia/llama3-chatqa-1.5-8b
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
mistralai/mistral-7b-instruct-v0.3
This LLM follows instructions, completes requests, and generates creative text.
writer/palmyra-med-70b-32k
Leading LLM for accurate, contextually relevant responses in the medical domain.
writer/palmyra-med-70b
Leading LLM for accurate, contextually relevant responses in the medical domain.
mediatek/breeze-7b-instruct
LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
ibm/granite-34b-code-instruct
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
ibm/granite-8b-code-instruct
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
aisingapore/sea-lion-7b-instruct
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
microsoft/phi-3-mini-4k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
databricks/dbrx-instruct
A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.
microsoft/phi-3-mini-128k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
mistralai/mixtral-8x22b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.
meta/llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
meta/codellama-70b
LLM capable of generating code from natural language and vice versa.
mistralai/mistral-7b-instruct-v0.2
This LLM follows instructions, completes requests, and generates creative text.
mistralai/mixtral-8x7b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.