Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Connect AI applications to enterprise data using industry-leading embedding and reranking models for information retrieval at scale.
Topic control model to keep conversations focused on approved topics, avoiding inappropriate content.
Industry leading jailbreak classification model for protection from adversarial attempts
Leading content safety model for enhancing the safety and moderation capabilities of LLMs
NVIDIA DGX Cloud trained multilingual LLM designed for mission critical use cases in regulated industries including financial services, government, heavy industry
Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
Multilingual LLM with emphasis on European languages supporting regulated use cases including financial services, government, heavy industry
Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
Transform PDFs into AI podcasts for engaging on-the-go audio content.
Automate voice AI agents with NVIDIA NIM microservices and Pipecat.
Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.
Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM
Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.
Multi-modal vision-language model that understands text/img/video and creates informative responses
Generates physics-aware video world states from text and image prompts for physical AI development.
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
SAM 2 is a segmentation model that enables fast, precise selection of any object in any video or image.
Powerful LLM designed for creative thinking and writing.
Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
Advanced LLM for reasoning, math, general knowledge, and function calling
Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
Create real-time digital twins by combining accelerated solvers, simulation AI, and virtual environments.
Generative downscaling model for generating high resolution regional scale weather fields.
FourCastNet predicts global atmospheric dynamics of various weather / climate variables.
Advanced AI model detects faces and identifies deep fake images.
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.
Efficiently refine retrieval results over multiple sources and languages.
World-class multilingual and cross-lingual question-answering retrieval.
Create intelligent virtual assistants for customer service across every industry
A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
Detects jailbreaking, bias, violence, profanity, sexual content, and unethical behavior
Advanced Small Language Model supporting RAG, summarization, classification, code, and agentic AI
Highly efficient Mixture of Experts model for RAG, summarization, entity extraction, and classification
Shutterstock Generative 3D service for 360 HDRi generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries.
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.
Efficient hybrid state-space model designed for conversational and reasoning tasks.
Rapidly identify and mitigate container security vulnerabilities with generative AI.
Sovereign AI model trained on Japanese language that understands regional nuances.
Sovereign AI model trained on Japanese language that understands regional nuances.
Enhance speech by correcting common audio degradations to create studio quality speech output.