Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.
Streamline evaluation, monitoring, and optimization of AI data flywheel with Weights & Biases.
Orchestrate AI agents for data flywheel with MLRun and NVIDIA NeMo microservices.
Improve safety, security, and privacy of AI systems at build, deploy and run stages.
Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.
State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following
FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
Design, test, and optimize a new generation of intelligence manufacturing data centers using digital twins.
State-of-the-art, multilingual model tailored to all 24 official European Union languages.
SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Create high quality images using Flux.1 in ComfyUI, guided by 3D.
Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.
Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.
Run computational-fluid dynamics (CFD) simulations
Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.
Generalist model to generate future world state as videos from text and image prompts to create synthetic training data for robots and autonomous vehicles.
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.
End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.
Leading reasoning and agentic AI accuracy model for PC and edge.
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Route LLM requests to the best model for the task at hand.
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
Transform PDFs into AI podcasts for engaging on-the-go audio content.
Trace and evaluate AI Agents with Weights & Biases.
Automate voice AI agents with NVIDIA NIM microservices and Pipecat.
Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.
Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM
Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
This NVIDIA Omniverseâ„¢ Blueprint demonstrates how commercial software vendors can create interactive digital twins.
FourCastNet predicts global atmospheric dynamics of various weather / climate variables.
Advanced AI model detects faces and identifies deep fake images.
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.
Create intelligent virtual assistants for customer service across every industry
Rapidly identify and mitigate container security vulnerabilities with generative AI.
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI model trained on Japanese language that understands regional nuances.
Unique language model that delivers an unmatched accuracy-efficiency performance.
Robust image classification model for detecting and managing AI-generated content.
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
Sovereign AI model trained on Japanese language that understands regional nuances.
This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
Grounding dino is an open vocabulary zero-shot object detection model.
Advanced small language generative AI model for edge applications
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
An MOE LLM that follows instructions, completes requests, and generates creative text.
Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.
An MOE LLM that follows instructions, completes requests, and generates creative text.