NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

15 results for

Filters

  • Download Available
    13
  • Image Generation
    3
  • Text-to-Image
    3
  • Code Generation
    1
  • Object Detection
    1
  • Retrieval Augmented Generation
    1
  • NVIDIA
    8
  • Black forest labs
    3
  • Baidu
    1
  • DeepSeek AI
    1
  • Meta
    1
  • Black-forest-labs

    FLUX.1-dev

    FLUX.1 is a state-of-the-art suite of image generation models
    Model
    Image Generation
    75.68K
    8mo
    Black-forest-labs

    FLUX.1-Kontext-dev

    FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
    Model
    Image Generation
    3.97K
    6mo
    Black-forest-labs

    FLUX.1-schnell

    FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
    Model
    Image Generation
    40.63K
    8mo
    NVIDIA

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    3.26M
    7mo
    DeepSeek AI

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Model
    Distillation
    3.94M
    7mo
    Meta

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    chat
    4.15M
    7mo
    NVIDIA

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    351K
    11mo
    NVIDIA

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    16.69K
    7mo
    NVIDIA

    nvclip

    NV-CLIP is a multimodal embeddings model for image and text.
    Model
    Computer vision
    37.87K
    8mo
    NVIDIA

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    Model
    ASR
    7.92K
    8mo
    NVIDIA

    studiovoice

    Enhance speech by correcting common audio degradations to create studio quality speech output.
    Model
    Nvidia Maxine
    616
    8mo
    Microsoft

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    5.25K
    6mo
    Baidu

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Model
    Optical Character Recognition
    234K
    7mo
    DGX Spark
    2 HRS

    Optimized JAX

    Optimize JAX to run on Spark
    Playbook
    DGX
    5mo
    DGX Spark
    1 HR

    NVFP4 Quantization

    Quantize a model to NVFP4 to run on Spark using TensorRT Model Optimizer
    Playbook
    DGX
    5mo
    Items per page
    of 1 pages