Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.
Streamline evaluation, monitoring, and optimization of AI data flywheel with Weights & Biases.
Orchestrate AI agents for data flywheel with MLRun and NVIDIA NeMo microservices.
Improve safety, security, and privacy of AI systems at build, deploy and run stages.
Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.
Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.
State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following
FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
Design, test, and optimize a new generation of intelligence manufacturing data centers using digital twins.
State-of-the-art, multilingual model tailored to all 24 official European Union languages.
SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Create high quality images using Flux.1 in ComfyUI, guided by 3D.
Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.
Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.
Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.
Generalist model to generate future world state as videos from text and image prompts to create synthetic training data for robots and autonomous vehicles.
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.
End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.
Leading reasoning and agentic AI accuracy model for PC and edge.
The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Route LLM requests to the best model for the task at hand.
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
Transform PDFs into AI podcasts for engaging on-the-go audio content.
Trace and evaluate AI Agents with Weights & Biases.
Automate voice AI agents with NVIDIA NIM microservices and Pipecat.
Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.
Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM
Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.
Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
FourCastNet predicts global atmospheric dynamics of various weather / climate variables.
Advanced AI model detects faces and identifies deep fake images.
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.
Create intelligent virtual assistants for customer service across every industry
Rapidly identify and mitigate container security vulnerabilities with generative AI.
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI model trained on Japanese language that understands regional nuances.
Unique language model that delivers an unmatched accuracy-efficiency performance.
Robust image classification model for detecting and managing AI-generated content.
Create intelligent, interactive avatars for customer service across industries
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
Sovereign AI model trained on Japanese language that understands regional nuances.
This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
Grounding dino is an open vocabulary zero-shot object detection model.
Vision foundation model capable of performing diverse computer vision and vision language tasks.
Advanced small language generative AI model for edge applications
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
Multilingual text reranking model.
English text embedding model for question-answering retrieval.
Multilingual text question-answering retrieval, transforming textual information into dense vector representations.
Generates high-quality numerical embeddings from text inputs.
Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
An MOE LLM that follows instructions, completes requests, and generates creative text.
Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.
An MOE LLM that follows instructions, completes requests, and generates creative text.