Built for agentic workflows, this model excels in coding, instruction following, and function calling
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
Multilingual text reranking model.
Multilingual text question-answering retrieval, transforming textual information into dense vector representations.
This LLM follows instructions, completes requests, and generates creative text.
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
This LLM follows instructions, completes requests, and generates creative text.