
Built for agentic workflows, this model excels in coding, instruction following, and function calling

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

Multilingual text reranking model.

Multilingual text question-answering retrieval, transforming textual information into dense vector representations.

This LLM follows instructions, completes requests, and generates creative text.

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

This LLM follows instructions, completes requests, and generates creative text.