
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.

A GenAI system that enhances and localizes product catalogs with rich text content and imagery.

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use

Open Mixture of Experts LLM (230B, 10B active) for reasoning, coding, and tool-use/agent workflows

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Excels in agentic coding and browser use and supports 256K context, delivering top results.

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.

Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.

Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

Improve safety, security, and privacy of AI systems at build, deploy and run stages.

A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

A multimodal, multilingual 16 MoE model with 17B parameters.

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

High performance reasoning model optimized for efficiency and edge deployment

Industry leading jailbreak classification model for protection from adversarial attempts

Multi-modal model to classify safety for input prompts as well output responses.

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry

Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

Transform PDFs into AI podcasts for engaging on-the-go audio content.

High accuracy and optimized performance for transcription in 25 languages

Robust Speech Recognition via Large-Scale Weak Supervision.

Leading content safety model for enhancing the safety and moderation capabilities of LLMs

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

Advanced AI model detects faces and identifies deep fake images.

Robust image classification model for detecting and managing AI-generated content.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.