
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.

A GenAI system that enhances and localizes product catalogs with rich text content and imagery.

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

A general purpose VLM ideal for chat and instruction based use cases

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

An MOE LLM that follows instructions, completes requests, and generates creative text.

An MOE LLM that follows instructions, completes requests, and generates creative text.

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

Multi-modal vision-language model that understands text/img and creates informative responses

Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

Powers complex conversations with superior contextual understanding, reasoning and text generation.

This LLM follows instructions, completes requests, and generates creative text.

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

Cutting-edge text generation model text understanding, transformation, and code generation.

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

Cutting-edge text generation model text understanding, transformation, and code generation.

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Advanced small language generative AI model for edge applications

Support Chinese and English chat, coding, math, instruction following, solving quizzes

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

Sovereign AI model trained on Japanese language that understands regional nuances.

Sovereign AI model trained on Japanese language that understands regional nuances.

Sovereign AI model trained on Japanese language that understands regional nuances.

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

Multi-modal vision-language model that understands text/img/video and creates informative responses

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Guardrail model to ensure that responses from LLMs are appropriate and safe

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.