Try NVIDIA NIM APIs

Explore Models Blueprints GPUs Docs

Manage My Privacy

Contact

Search Results

Searching for: Generative AI

Sorting by Most Recent

meta llama-guard-4-12b

Multi-modal model to classify safety for input prompts as well output responses.

llm multimodal safety content safety guardrail content moderator meta

google gemma-3n-e4b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation speech recognition visual qa chat google

google gemma-3n-e2b-it

An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments

language generation speech recognition visual qa chat google

nvidia cosmos-transfer1-7b

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

synthetic data generation autonomous vehicles physical ai robotics video-to-world nvidia

nvidia llama-3.2-nemoretriever-1b-vlm-embed-v1

Multimodal question-answer retrieval representing user queries as text and documents as images.

nemo retriever embedding retrieval augmented generation text-to-embedding nvidia

nvidia Biomedical AI-Q Research Agent Blueprint

Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint

launchable agent blueprint blueprint retrieval-augmented generation llm nvidia

nvidia Refine AI Agents through Continuous Model Distillation with Data Flywheels

Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.

nim launchable data flywheel blueprint enterprise nemo microservices nvidia ai nvidia

wandb AI Observability for Data Flywheel

Streamline evaluation, monitoring, and optimization of AI data flywheel with Weights & Biases.

ai agents data flywheel wandb blueprint observability partner nvidia ai wandb

iguazio AI Orchestration for Data Flywheel

Orchestrate AI agents for data flywheel with MLRun and NVIDIA NeMo microservices.

orchestration ai agents data flywheel blueprint partner nvidia ai iguazio

nvidia Safety for Agentic AI

Improve safety, security, and privacy of AI systems at build, deploy and run stages.

security launchable blueprint safety privacy nemo guardrails open models nvidia ai nvidia

nvidia AI Agent for Telecom Network Configuration Planning

Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.

nim blueprint simulation telecommunications nvidia ai nvidia

deepseek-ai deepseek-r1-0528

Updated version of DeepSeek-R1 with enhanced reasoning, coding, math, and reduced hallucination.

coding chat math advanced reasoning deepseek-ai

speakleash bielik-11b-v2.3-instruct

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

polish sovereign ai chat chatbots summarization speakleash

qwen qwen3-235b-a22b

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

chat complex math advanced reasoning instruction following qwen

black-forest-labs FLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

image generation text-to-image run-on-rtx black-forest-labs

nvidia Build Digital Twins for AI Factory Design and Operations

Design, test, and optimize a new generation of intelligence manufacturing data centers using digital twins.

ai factory industrial nvidia omniverse blueprint simulation enterprise nvidia

utter-project eurollm-9b-instruct

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

sovereign ai chat chat text-to-text multilingual european regional language generation utter-project

gotocompany gemma-2-9b-cpt-sahabatai-instruct

SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.

sovereign ai chat indonesian chat text-to-text regional language generation gotocompany

mistralai mistral-small-3.1-24b-instruct-2503

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

language generation multimodal image understanding mistralai

nvidia 3D Guided Generative AI

Create high quality images using Flux.1 in ComfyUI, guided by 3D.

blueprint run-on-rtx nvidia ai nvidia

nvidia Build an AI Agent for Enterprise Research

Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.

nim launchable llama nemotron reasoning blueprint enterprise retrieval-augmented generation nvidia ai nemo retriever nvidia

nvidia AI Weather Analytics with Earth-2

Develop AI powered weather analysis and forecasting application visualizing multi-layered geospatial data.

blueprint climate science enterprise weather simulation ai weather prediction nvidia ai earth-2 nvidia

siemens simcenter-star-ccm+

Run computational-fluid dynamics (CFD) simulations

aerodynamics cae fluid-dynamics simulation heat-transfer computer-aided engineering siemens

cadence fidelity

Run computational-fluid dynamics (CFD) simulations

aerodynamics cae fluid-dynamics simulation heat-transfer computer-aided engineering cadence

ansys fluent

Run computational-fluid dynamics (CFD) simulations

aerodynamics cae fluid-dynamics simulation heat-transfer computer-aided engineering ansys

nvidia Synthetic Manipulation Motion Generation for Robotics

Generate exponentially large amounts of synthetic motion trajectories for robot manipulation from just a few human demonstrations.

nvidia omniverse blueprint synthetic data enterprise robotics physical ai robot learning humanoids nvidia isaac gr00t text-to-world image-to-world teleop nvidia

nvidia cosmos-predict1-7b

Generalist model to generate future world state as videos from text and image prompts to create synthetic training data for robots and autonomous vehicles.

synthetic data generation autonomous vehicles physical ai robotics text-to-world image-to-world nvidia

nvidia cosmos-predict1-5b

Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

synthetic data generation physical ai policy evaluation robotics video-to-world nvidia

nvidia Test Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

industrial nvidia omniverse blueprint simulation enterprise omniverse blueprint nvidia

nvidia sparsedrive

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

autonomous vehicles bev av stack automotive nvidia

nvidia llama-3.1-nemotron-nano-8b-v1

Leading reasoning and agentic AI accuracy model for PC and edge.

math advanced reasoning instruction following function calling nvidia

deepseek-ai deepseek-r1-distill-llama-8b

Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

distillation coding chat reasoning run-on-rtx math deepseek-ai

nvidia LLM Router

Route LLM requests to the best model for the task at hand.

launchable blueprint llm router nvidia ai nvidia

deepseek-ai deepseek-r1-distill-qwen-32b

Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

deepseek-ai deepseek-r1-distill-qwen-14b

Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

deepseek-ai deepseek-r1-distill-qwen-7b

Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.

coding distillation chat reasoning math deepseek-ai

microsoft phi-4-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generation chat text-to-text language generation microsoft

nvidia Evo 2 Protein Design

This workflow shows how generative AI can generate DNA sequences that can be translated into proteins for bioengineering.

blueprint nim biology bionemo drug discovery protein generation nvidia

deepseek-ai deepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chat math advanced reasoning deepseek-ai

nvidia Build an Enterprise RAG pipeline

Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.

nim launchable blueprint enterprise retrieval-augmented generation nvidia ai nemo retriever nvidia

nvidia Build A Generative Protein Binder Design Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.

nvidia bionemo blueprint enterprise bionemo biology drug discovery protein generation nvidia

nvidia PDF to Podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

blueprint multi-modal launchable text-to-speech conversational ai pdf-to-podcast nvidia ai ai agent text-to-speech nvidia

wandb Traceability for Agentic AI

Trace and evaluate AI Agents with Weights & Biases.

traceability launchable ai agents wandb blueprint partner nvidia ai wandb

pipecat Voice Agent Framework for Conversational AI

Automate voice AI agents with NVIDIA NIM microservices and Pipecat.

pipecat launchable ai agents blueprint conversational ai partner nvidia ai pipecat

llamaindex Document Research Assistant for Blog Creation

Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

blog creation launchable ai agents blueprint partner llamaindex nvidia ai llamaindex

langchain Structured Report Generation

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

langgraph report generation launchable ai agents blueprint partner nvidia ai langchain

crewai Code Documentation for Software Development

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

code documentation crewai launchable ai agents blueprint partner nvidia ai crewai

university-at-buffalo cached

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

nemo retriever chart element detection image-to-text university-at-buffalo

baidu paddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

optical character recognition table extraction optical character detection nemo retriever data ingestion run-on-rtx extraction baidu

nvidia Build a Digital Twin for Interactive Fluid Simulation

This NVIDIA Omniverse™ Blueprint demonstrates how commercial software vendors can create interactive digital twins.

nvidia omniverse blueprint cae simulation external aerodynamics enterprise computer-aided-engineering nvidia

nvidia corrdiff

Generative downscaling model for generating high resolution regional scale weather fields.

ai weather prediction weather simulation earth-2 nvidia

nvidia fourcastnet

FourCastNet predicts global atmospheric dynamics of various weather / climate variables.

weather simulation ai weather prediction climate science earth-2 nvidia

hive deepfake-image-detection

Advanced AI model detects faces and identifies deep fake images.

computer vision ai safety deep fake detection content moderation hive

nvidia Build a Video Search and Summarization (VSS) Agent

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

vision video-to-text generative ai launchable blueprint chat enterprise nvidia ai nvidia

nvidia 3D Conditioning for Precise Visual Generative AI

Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset.

visual design nvidia omniverse blueprint simulation enterprise nvidia

nvidia Build an AI Virtual Assistant

Create intelligent virtual assistants for customer service across every industry

customer service launchable blueprint retrieval-augmented generation llm contact center nvidia ai nvidia

nvidia Vulnerability Analysis for Container Security

Rapidly identify and mitigate container security vulnerabilities with generative AI.

generative ai launchable nv-embedqa-e5-v5 blueprint llama-3_1-70b-instruct cybersecurity nvidia ai nvidia

institute-of-science-tokyo llama-3.1-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat regional language generation institute-of-science-tokyo

institute-of-science-tokyo llama-3.1-swallow-8b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

sovereign ai large language model chat chat regional language generation institute-of-science-tokyo

nvidia llama-3.1-nemotron-51b-instruct

Unique language model that delivers an unmatched accuracy-efficiency performance.

chat language generation chat text-to-text nvidia

hive ai-generated-image-detection

Robust image classification model for detecting and managing AI-generated content.

image classification computer vision ai safety content moderation hive

yentinglin llama-3-taiwan-70b-instruct

Sovereign AI model finetuned on Traditional Mandarin and English data using the Llama-3 architecture.

regional language generation chat code generation large language models yentinglin

tokyotech-llm llama-3-swallow-70b-instruct-v0.1

Sovereign AI model trained on Japanese language that understands regional nuances.

large language model chat regional language generation tokyotech-llm

nvidia Build A Generative Virtual Screening Pipeline

This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.

chemistry nim nvidia bionemo blueprint enterprise bionemo docking drug discovery nvidia

ai21labs jamba-1.5-mini-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chat chat language generation text-to-text ai21labs

ai21labs jamba-1.5-large-instruct

Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.

chat chat language generation text-to-text ai21labs

microsoft phi-3.5-mini-instruct

Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments

code generation chat text-to-text language generation large language models microsoft

nvidia nv-grounding-dino

Grounding dino is an open vocabulary zero-shot object detection model.

object detection computer vision deepstream nvidia nim nvidia

briaai BRIA-2.3

An enterprise-grade text-to-image model trained on a compliant dataset produces high quality images.

image generation text-to-image briaai

google gemma-2-2b-it

Advanced small language generative AI model for edge applications

chat code generation chat text-to-text language generation google

nvidia usdsearch

AI-powered search for OpenUSD data, 3D models, images, and assets using text or image-based inputs.

openusd synthetic data generation digital twin usd text-to-3d nvidia nim nvidia

nvidia maisi

MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.

image generation medical imaging nvidia nim nvidia

01-ai yi-large

Powerful model trained on English and Chinese for diverse tasks including chatbot and creative writing.

chat code generation chat text-to-text multilingual 01-ai

nvidia retail-object-detection

EfficientDet-based object detection network to detect 100 specific retail objects from an input video.

object detection image cv vlm computer vision tao toolkit video nvidia nim nvidia

ipd rfdiffusion

A generative model of protein backbones for protein binder design.

biology nim bionemo drug discovery protein generation ipd

google paligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

image cv vision assistant vlm visual question answering computer vision language generation image-to-text video google

aisingapore sea-lion-7b-instruct

LLM to represent and serve the linguistic and cultural diversity of Southeast Asia

chat text-to-text regional language generation large language models aisingapore

mistralai mixtral-8x22b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai

stabilityai stable-video-diffusion

Stable Video Diffusion (SVD) is a generative diffusion model that leverages a single image as a conditioning frame to synthesize video sequences.

image generation text-to-image stabilityai

stabilityai sdxl-turbo

A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

image generation text-to-image stabilityai

mistralai mixtral-8x7b-instruct-v0.1

An MOE LLM that follows instructions, completes requests, and generates creative text.

advanced reasoning chat code generation chat text-to-text large language models mistralai