
An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

A general purpose VLM ideal for chat and instruction based use cases

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.

Transform your scene idea into ready-to-use 3D assets using Llama 3.1 8B, NV SANA, and Microsoft TRELLIS

Elevate Shopping Experiences Online and In Stores.

Stable Diffusion 3.5 is a popular text-to-image generation model

FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.

ProteinMPNN is a deep learning model for predicting amino acid sequences for protein backbones.


This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.

Lightweight reasoning model for applications in latency bound, memory/compute constrained environments


A general purpose multimodal, multilingual 128 MoE model with 17B parameters.

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

A multimodal, multilingual 16 MoE model with 17B parameters.

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.

Create intelligent virtual assistants for customer service across every industry

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.


Design, test, and optimize a new generation of intelligence manufacturing data centers using digital twins.

Advanced LLM for code generation, reasoning, and fixing across popular programming languages.

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

Generates a multiple sequence alignment from a query sequence and a protein sequence database search.

FLUX.1 is a state-of-the-art suite of image generation models

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds


Powers complex conversations with superior contextual understanding, reasoning and text generation.

Built for agentic workflows, this model excels in coding, instruction following, and function calling

This LLM follows instructions, completes requests, and generates creative text.

Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint

A generative model of protein backbones for protein binder design.

Cutting-edge open multimodal model exceling in high-quality reasoning from images.

Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities

Cutting-edge text generation model text understanding, transformation, and code generation.

LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.

Cutting-edge text generation model text understanding, transformation, and code generation.

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Advanced small language generative AI model for edge applications

Support Chinese and English chat, coding, math, instruction following, solving quizzes

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Long context cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.

Cutting-edge lightweight open language model exceling in high-quality reasoning.

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.

This LLM follows instructions, completes requests, and generates creative text.

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

Model for writing and interacting with code across a wide range of programming languages and tasks.

Powers complex conversations with superior contextual understanding, reasoning and text generation.

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

Sovereign AI model trained on Japanese language that understands regional nuances.

Sovereign AI model trained on Japanese language that understands regional nuances.

Sovereign AI model trained on Japanese language that understands regional nuances.

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM.

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.

Predicts the 3D structure of a protein from its amino acid sequence.

Predicts the 3D structure of a protein from its amino acid sequence.


Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.


Cutting-edge open multimodal model exceling in high-quality reasoning from images.

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

Advanced programming model for code completion, summarization, and generation

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.