NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

130 results for

Filters (5)

  • API Endpoint
    63
  • Download Available
    55
  • Launchable
    9
  • Enterprise
    4
  • Code Generation
    27
  • Image-to-Text
    10
  • Drug Discovery
    7
  • Synthetic Data Generation
    6
  • Retrieval Augmented Generation
    2
  • NVIDIA
    41
  • Mistral AI
    12
  • Microsoft
    11
  • Meta
    9
  • Google
    9
  • NVIDIA AI
    7
  • NVIDIA BioNemo
    1
  • NVIDIA Omniverse
    0
  • NVIDIA Isaac GR00T
    0
  • supported language - english
  • language generation
  • chat
  • Coding
  • video understanding
  • Baichuan AI

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    473K
    9mo
    BAAI

    bge-m3

    Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.
    Model
    Embeddings
    2.07M
    10mo
    NVIDIA
    Launchable

    Biomedical AI-Q Research Agent Blueprint

    Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
    Blueprint
    Launchable
    2w
    MediaTek

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    475K
    9mo
    NVIDIA
    Enterprise

    Build A Generative Protein Binder Design Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
    Blueprint
    NVIDIA BioNemo
    2w
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    vision
    2w
    NVIDIA
    LaunchableEnterprise

    Build an AI Agent for Enterprise Research

    Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.
    Blueprint
    NIM
    3w
    NVIDIA
    Launchable

    Build an AI Virtual Assistant

    Create intelligent virtual assistants for customer service across every industry
    Blueprint
    Customer Service
    2w
    NVIDIA
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NIM
    2w
    THUDM

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    Text Translation
    511K
    7mo
    NVIDIA

    cosmos-nemotron-34b

    Multi-modal vision-language model that understands text/img/video and creates informative responses
    Model
    VLM
    6
    1y
    NVIDIA

    cosmos-predict1-5b

    Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.
    Model
    Synthetic Data Generation
    22.25K
    11mo
    NVIDIA

    cosmos-reason1-7b

    Reasoning vision language model (VLM) for physical AI and robotics.
    Model
    video understanding
    15.93K
    6mo
    NVIDIA

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    Model
    video understanding
    194K
    2mo
    NVIDIA

    cosmos-transfer1-7b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    15.87K
    8mo
    NVIDIA

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Model
    Synthetic Data Generation
    1w
    Cyborg
    Launchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    2w
    DeepSeek AI

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    4.14M
    8mo
    DeepSeek AI

    deepseek-r1-distill-qwen-14b

    Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.07K3.78M
    9mo
    DeepSeek AI

    deepseek-r1-distill-qwen-32b

    Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.44K4.17M
    9mo
    DeepSeek AI

    deepseek-r1-distill-qwen-7b

    Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.21K4.11M
    9mo
    DeepSeek AI

    deepseek-v3.1-terminus

    DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.
    Model
    tool calling
    12.1M
    5mo
    Mistral AI

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    5.09M
    2mo
    Abacus.AI

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    chat
    534K
    9mo
    Items per page
    of 6 pages