Built for agentic workflows, this model excels in coding, instruction following, and function calling
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
Multilingual text reranking model.
Multilingual text question-answering retrieval, transforming textual information into dense vector representations.
This LLM follows instructions, completes requests, and generates creative text.
An MOE LLM that follows instructions, completes requests, and generates creative text.
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
This LLM follows instructions, completes requests, and generates creative text.
An MOE LLM that follows instructions, completes requests, and generates creative text.