NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

12 results for

Filters

  • Partner Endpoint
    2
  • Download Available
    5
  • Enterprise Blueprint
    3
  • Launchable
    3
  • Developer Example
    1
  • Optical Character Recognition
    3
  • Drug Discovery
    2
  • Speech-to-Text
    1
  • Deep Infra
    2
  • GMI Cloud
    1
  • NVIDIA
    9
  • Cyborg
    1
  • DeepSeek AI
    1
  • OpenAI
    1
  • NVIDIA BioNemo
    2
  • NVIDIA AI
    1
  • DeepSeek AI
    Downloadable

    deepseek-v4-flash

    DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.
    Model
    coding
    Items per page
    of 1 pages
    11.27M
    3w
    Cyborg
    Launchable

    Cyborg Enterprise RAG

    Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.
    Blueprint
    NIM
    3mo
    NVIDIA
    Downloadable

    nemoretriever-ocr

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    13.46K
    10mo
    NVIDIA
    Downloadable

    nemoretriever-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    2.07M
    9mo
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    310K
    2mo
    Financial Services
    LaunchableDeveloper Example

    Quantitative Portfolio Optimization

    Enable fast, scalable, and real-time portfolio optimization for financial institutions.
    Blueprint
    developer example
    3mo
    DGX Spark
    30 MIN

    Speculative Decoding

    Learn how to set up speculative decoding for fast inference on Spark
    Playbook
    DGX
    7mo
    OpenAI
    Downloadable

    whisper-large-v3

    Robust Speech Recognition via Large-Scale Weak Supervision.
    Model
    ASR
    58.49K
    1y
    DGX Station
    30 MINS

    Local Coding Agent

    Run local CLI coding agents with Ollama on DGX Station (NVIDIA GB300) using glm-4.7-flash (fast) or unsloth/GLM-4.7-GGUF:Q8_0 (best quality)
    Playbook
    Coding
    1mo
    General
    LaunchableEnterprise

    Build an Enterprise RAG Pipeline Blueprint

    Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.
    Blueprint
    NVIDIA AI
    3mo
    Drug Discovery
    Enterprise

    Build A Generative Protein Binder Design Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
    Blueprint
    NVIDIA BioNemo
    3mo
    Drug Discovery
    Enterprise

    Build A Generative Virtual Screening Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.
    Blueprint
    Chemistry
    3mo