Create AI agents that reason, plan, reflect and refine to produce high-quality reports based on source materials of your choice.
Investigate, understand, and interpret single cell data in minutes, not days by leveraging RAPIDS-singlecell, powered by NVIDIA RAPIDS
Easily run essential genomics workflows to save time leveraging Parabricks
Run computational-fluid dynamics (CFD) simulations
Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Cutting-edge vision-language model exceling in retrieving text and metadata from images.
Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Connect AI applications to multimodal enterprise data with a retrieval augmented generation (RAG) pipeline.
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
Leading content safety model for enhancing the safety and moderation capabilities of LLMs
NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry
Transform PDFs into AI podcasts for engaging on-the-go audio content.
Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Automatic speech recognition model that transcribes speech in lower case English with record-setting accuracy and performance
FourCastNet predicts global atmospheric dynamics of various weather / climate variables.
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Create intelligent virtual assistants for customer service across every industry
A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification
Shutterstock Generative 3D service for 360 HDRi generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries.
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI model trained on Japanese language that understands regional nuances.
Enhance speech by correcting common audio degradations to create studio quality speech output.
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Cutting-edge vision-language model exceling in high-quality reasoning from images.
Cutting-edge vision-Language model exceling in high-quality reasoning from images.
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
Unique language model that delivers an unmatched accuracy-efficiency performance.
Predicts the 3D structure of a protein from its amino acid sequence.
Generates consistent characters across a series of images without requiring additional training.
Ingest and extract highly accurate insights contained in text, graphs, charts, and tables within massive volumes of PDF documents.
Predicts the 3D structure of a protein from its amino acid sequence.
Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.
Sovereign AI model trained on Japanese language that understands regional nuances.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.
Advanced LLM based on Mixture of Experts architecure to deliver compute efficient content generation
Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
Grounding dino is an open vocabulary zero-shot object detection model.
Enable smooth global interactions in 36 languages.
State-of-the-art accuracy and speed for English transcriptions.
Vision foundation model capable of performing diverse computer vision and vision language tasks.
Shutterstock Generative 3D service for 3D asset generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries
Getty Images’ API service for 4K image generation. Trained on NVIDIA Edify using Getty Images' commercially safe creative libraries.
Estimate gaze angles of a person in a video and redirect to make it frontal.
Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.
Model for writing and interacting with code across a wide range of programming languages and tasks.
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
Advanced LLM to generate high-quality, context-aware responses for chatbots and search engines.
Grades responses on five attributes helpfulness, correctness, coherence, complexity and verbosity.
Creates diverse synthetic data that mimics the characteristics of real-world data.
Leading LLM for accurate, contextually relevant responses in the medical domain.
Leading LLM for accurate, contextually relevant responses in the medical domain.
Generates high-quality numerical embeddings from text inputs.
Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
A generative model of protein backbones for protein binder design.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
Cutting-edge lightweight open language model exceling in high-quality reasoning.
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
A general-purpose LLM with state-of-the-art performance in language understanding, coding, and RAG.
Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
An MOE LLM that follows instructions, completes requests, and generates creative text.
Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
Cutting-edge model built on Google's Gemma-7B specialized for code generation and code completion.
GPU-accelerated generation of text embeddings used for question-answering retrieval.
LLM capable of generating code from natural language and vice versa.
Run Google's DeepVariant optimized for GPU. Switch models for high accuracy on all major sequencers.
A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation
An MOE LLM that follows instructions, completes requests, and generates creative text.