NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

15 results for

Filters

  • Download Available
    13
  • Image Generation
    3
  • Text-to-Image
    3
  • Code Generation
    1
  • Object Detection
    1
  • Retrieval Augmented Generation
    1
  • NVIDIA
    8
  • Black forest labs
    3
  • Baidu
    1
  • DeepSeek AI
    1
  • Meta
    1
  • Black-forest-labs
    Downloadable

    FLUX.1-dev

    FLUX.1 is a state-of-the-art suite of image generation models
    Model
    Text-to-Image
    109K
    9mo
    Black-forest-labs
    Downloadable

    FLUX.1-Kontext-dev

    FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
    Model
    Text-to-Image
    4.17K
    7mo
    Black-forest-labs
    Downloadable

    FLUX.1-schnell

    FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
    Model
    Text-to-Image
    49.25K
    9mo
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    3.06M
    7mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    5M
    8mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    chat
    5.64M
    8mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    251K
    12mo
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    15.36K
    8mo
    NVIDIA
    Downloadable

    nvclip

    NV-CLIP is a multimodal embeddings model for image and text.
    Model
    Computer vision
    23.65K
    9mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    8.48K
    9mo
    NVIDIA
    Downloadable

    studiovoice

    Enhance speech by correcting common audio degradations to create studio quality speech output.
    Model
    Nvidia Maxine
    637
    9mo
    Microsoft
    Downloadable

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    5.75K
    6mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    188K
    8mo
    DGX Spark
    2 HRS

    Optimized JAX

    Optimize JAX to run on Spark
    Playbook
    DGX
    5mo
    DGX Spark
    1 HR

    NVFP4 Quantization

    Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer
    Playbook
    DGX
    5mo
    Items per page
    of 1 pages