nvidia/llama-3.2-nv-rerankqa-1b-v1
Efficiently refine retrieval results over multiple sources and languages.
nvidia/llama-3.2-nv-embedqa-1b-v1
World-class multilingual and cross-lingual question-answering retrieval.
A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification
shutterstock/edify-360-hdri
Shutterstock Generative 3D service for 360 HDRi generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries.
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.
zyphra/zamba2-7b-instruct
Efficient hybrid state-space model designed for conversational and reasoning tasks.
institute-of-science-tokyo/llama-3.1-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Unique language model that delivers an unmatched accuracy-efficiency performance.
qwen/qwen2-7b-instruct
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
abacusai/dracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
yentinglin/llama-3-taiwan-70b-instruct
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
tokyotech-llm/llama-3-swallow-70b-instruct-v0.1
Sovereign AI model trained on Japanese language that understands regional nuances.
microsoft/phi-3.5-vision-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
ai21labs/jamba-1.5-mini-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
ai21labs/jamba-1.5-large-instruct
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
nvidia/nemotron-mini-4b-instruct
Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
microsoft/phi-3.5-moe-instruct
Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation
microsoft/phi-3.5-mini-instruct
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
rakuten/rakutenai-7b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
rakuten/rakutenai-7b-chat
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
ipd/proteinmpnn
ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.
microsoft/florence-2
Vision foundation model capable of performing diverse computer vision and vision language tasks.
google/gemma-2-2b-it
Advanced small language generative AI model for edge applications
nvidia/usdcode-llama3-70b-instruct
State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
Shutterstock/edify-3d
Shutterstock Generative 3D service for 3D asset generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries
GettyImages/edify-image
Getty Images’ API service for 4K image generation. Trained on NVIDIA Edify using Getty Images' commercially safe creative libraries.
thudm/chatglm3-6b
Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
mistralai/mamba-codestral-7b-v0.1
Model for writing and interacting with code across a wide range of programming languages and tasks.
baichuan-inc/baichuan2-13b-chat
Support Chinese and English chat, coding, math, instruction following, solving quizzes
Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.
Powers complex conversations with superior contextual understanding, reasoning and text generation.
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
nv-mistralai/mistral-nemo-12b-instruct
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
nvidia/nv-rerankqa-mistral-4b-v3
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
nvidia/nv-embedqa-e5-v5
GPU-accelerated generation of text embeddings used for question-answering retrieval.
nvidia/nv-embedqa-mistral-7b-v2
GPU-accelerated generation of text embeddings used for question-answering retrieval.
nvidia/maisi
MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
deepseek-ai/deepseek-coder-6.7b-instruct
Powerful coding model offering advanced capabilities in code generation, completion, and infilling
microsoft/phi-3-medium-128k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
bigcode/starcoder2-7b
Advanced programming model for code completion, summarization, and generation
bigcode/starcoder2-15b
Advanced programming model for code completion, summarization, and generation
google/gemma-2-27b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
google/gemma-2-9b-it
Cutting-edge text generation model text understanding, transformation, and code generation.
nvidia/nemotron-4-340b-reward
Grades responses on five attributes helpfulness, correctness, coherence, complexity and verbosity.
nvidia/nemotron-4-340b-instruct
Creates diverse synthetic data that mimics the characteristics of real-world data.
mistralai/mistral-7b-instruct-v0.3
This LLM follows instructions, completes requests, and generates creative text.
upstage/solar-10.7b-instruct
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
baai/bge-m3
Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.
mediatek/breeze-7b-instruct
LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
google/codegemma-1.1-7b
Advanced programming model for code generation, completion, reasoning, and instruction following.
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.
ipd/rfdiffusion
A generative model of protein backbones for protein binder design.
microsoft/phi-3-small-8k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
microsoft/phi-3-small-128k-instruct
Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
microsoft/phi-3-medium-4k-instruct
Cutting-edge lightweight open language model exceling in high-quality reasoning.
microsoft/phi-3-vision-128k-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
aisingapore/sea-lion-7b-instruct
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
microsoft/phi-3-mini-4k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
databricks/dbrx-instruct
A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.
snowflake/arctic
Delivers high efficiency inference for enterprise applications focused on SQL generation and coding.
snowflake/arctic-embed-l
GPU-accelerated generation of text embeddings.
microsoft/phi-3-mini-128k-instruct
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
meta/llama3-70b-instruct
Powers complex conversations with superior contextual understanding, reasoning and text generation.
meta/llama3-8b-instruct
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
google/recurrentgemma-2b
Novel recurrent architecture based language model for faster inference when generating long sequences.
google/codegemma-7b
Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.
google/gemma-2b
Lightweight language model deployable on laptop, desktop or the cloud for summarization and reasoning.
nvidia/embed-qa-4
GPU-accelerated generation of text embeddings used for question-answering retrieval.
nvidia/rerank-qa-mistral-4b
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
google/gemma-7b
Cutting-edge text generation model text understanding, transformation, and code generation.
meta/llama2-70b
Cutting-edge large language AI model capable of generating text and code in response to prompts.
mistralai/mistral-7b-instruct-v0.2
This LLM follows instructions, completes requests, and generates creative text.
nvidia/molmim
MolMIM performs controlled generation, finding molecules with the right properties.