NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

142 results for

Filters (5)

  • API Endpoint
    65
  • Download Available
    63
  • Launchable
    10
  • Enterprise
    5
  • Code Generation
    27
  • Image-to-Text
    12
  • Drug Discovery
    7
  • Text Translation
    5
  • Synthetic Data Generation
    3
  • NVIDIA
    42
  • Meta
    12
  • Mistral AI
    12
  • Microsoft
    12
  • Google
    10
  • NVIDIA AI
    8
  • NVIDIA Omniverse
    1
  • NVIDIA BioNemo
    1
  • NVIDIA Isaac GR00T
    1
  • supported language - english
  • regional language generation
  • Text Translation
  • Coding
  • long context
  • Baichuan AI

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Model
    Chinese Language Generation
    473K
    9mo
    BAAI

    bge-m3

    Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.
    Model
    Embeddings
    2.07M
    10mo
    NVIDIA
    Launchable

    Biomedical AI-Q Research Agent Blueprint

    Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint
    Blueprint
    Launchable
    2w
    MediaTek

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    Model
    chat
    475K
    9mo
    NVIDIA
    Enterprise

    Build A Generative Protein Binder Design Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
    Blueprint
    NVIDIA BioNemo
    2w
    NVIDIA
    LaunchableEnterprise

    Build a Video Search and Summarization (VSS) Agent

    Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
    Blueprint
    vision
    2w
    NVIDIA
    LaunchableEnterprise

    Build an AI Agent for Enterprise Research

    Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.
    Blueprint
    NIM
    3w
    NVIDIA
    Launchable

    Build an AI Virtual Assistant

    Create intelligent virtual assistants for customer service across every industry
    Blueprint
    Customer Service
    2w
    NVIDIA
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NIM
    2w
    University at Buffalo

    cached

    Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.
    Model
    nemo retriever
    738
    1y
    NVIDIA

    canary-1b-asr

    Multi-lingual model supporting speech-to-text recognition and translation.
    Model
    Automatic Speech Recognition
    1.58K
    11mo
    THUDM

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    Model
    Text Translation
    511K
    7mo
    NVIDIA

    cosmos-nemotron-34b

    Multi-modal vision-language model that understands text/img/video and creates informative responses
    Model
    VLM
    6
    1y
    NVIDIA

    cosmos-reason1-7b

    Reasoning vision language model (VLM) for physical AI and robotics.
    Model
    video understanding
    15.93K
    6mo
    NVIDIA

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    Model
    video understanding
    194K
    2mo
    Cyborg
    Launchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    2w
    DeepSeek AI

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    4.14M
    8mo
    DeepSeek AI

    deepseek-r1-distill-qwen-14b

    Distilled version of Qwen 2.5 14B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.07K3.78M
    9mo
    DeepSeek AI

    deepseek-r1-distill-qwen-32b

    Distilled version of Qwen 2.5 32B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.44K4.17M
    9mo
    DeepSeek AI

    deepseek-r1-distill-qwen-7b

    Distilled version of Qwen 2.5 7B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    coding
    2.21K4.11M
    9mo
    DeepSeek AI

    deepseek-v3.1

    DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.
    Model
    Reasoning
    14.26M
    6mo
    DeepSeek AI

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    Model
    long context
    14.82M
    2mo
    Mistral AI

    devstral-2-123b-instruct-2512

    State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.
    Model
    coding
    5.09M
    2mo
    Abacus.AI

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    chat
    534K
    9mo
    Items per page
    of 6 pages