
A context‑aware safety model that applies reasoning to enforce domain‑specific policies.

An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

Distill and deploy domain-specific AI models from unstructured financial data to generate market signals efficiently—scaling your workflow with the NVIDIA Data Flywheel Blueprint for high-performance, cost-efficient experimentation.

Accelerate post-training of end-to-end autonomous vehicle stacks with vector search and retrieval for large video datasets.

Japanese-specialized large-language-model for enterprises to read and understand complex business documents.

80B parameter AI model with hybrid reasoning, MoE architecture, support for 119 languages.

Transform your scene idea into ready-to-use 3D assets using Llama 3.1 8B, NV SANA, and Microsoft TRELLIS

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

Stable Diffusion 3.5 is a popular text-to-image generation model

Smaller Mixture of Experts (MoE) text-only LLM for efficient AI reasoning and math

Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.

Accurate and optimized English transcriptions with punctuation and word timestamps

Sensor-captured radio enables real-time awareness, AI-driven analytics for actionable, searchable insights.

Multilingual 7B LLM, instruction-tuned on all 24 EU languages for stable, culturally aligned output.

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

An MOE LLM that follows instructions, completes requests, and generates creative text.

An MOE LLM that follows instructions, completes requests, and generates creative text.

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

Use the multi-LLM compatible NIM container to deploy a broad range of LLMs from Hugging Face.

Expressive and engaging text-to-speech, generated from a short audio sample.

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

Multi-modal model to classify safety for input prompts as well output responses.

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.

Removes unwanted noises from audio improving speech intelligibility.

Enhance speech by correcting common audio degradations to create studio quality speech output.

Advanced LLM for reasoning, math, general knowledge, and function calling

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Expressive and engaging text-to-speech, generated from a short audio sample.

This LLM follows instructions, completes requests, and generates creative text.

Route LLM requests to the best model for the task at hand.

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.

Cutting-edge text generation model text understanding, transformation, and code generation.

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

Cutting-edge text generation model text understanding, transformation, and code generation.

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Advanced small language generative AI model for edge applications

Support Chinese and English chat, coding, math, instruction following, solving quizzes

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

This LLM follows instructions, completes requests, and generates creative text.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

Easily run essential genomics workflows to save time leveraging Parabricks

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Multi-lingual model supporting speech-to-text recognition and translation.

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.

Estimate gaze angles of a person in a video and redirect to make it frontal.

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Multi-modal vision-language model that understands text/img/video and creates informative responses

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior

Guardrail model to ensure that responses from LLMs are appropriate and safe

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

Leaderboard topping reward model supporting RLHF for better alignment with human preferences.

Advanced text-to-image model for generating high quality images

Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.