NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Download Available
    13
  • API Endpoint
    0
  • Image Generation
    3
  • Text-to-Image
    3
  • Code Generation
    1
  • Retrieval Augmented Generation
    1
  • Object Detection
    1
  • NVIDIA
    6
  • Black forest labs
    3
  • Meta
    1
  • Microsoft
    1
  • DeepSeek AI
    1
  • Run-on-RTX
  • 13 models
    Microsoft
    Downloadable

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    text-to-3d
    5.66K
    6mo
    Black-forest-labs
    Downloadable

    FLUX.1-Kontext-dev

    FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
    Text-to-Image
    4.17K
    7mo
    Black-forest-labs
    Downloadable

    FLUX.1-schnell

    FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
    Text-to-Image
    45.78K
    9mo
    Black-forest-labs
    Downloadable

    FLUX.1-dev

    FLUX.1 is a state-of-the-art suite of image generation models
    Text-to-Image
    108K
    9mo
    DeepSeek AI
    Downloadable

    deepseek-r1-distill-llama-8b

    Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.
    Distillation
    4.96M
    8mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v2

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    265K
    12mo
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    16.02K
    8mo
    Baidu
    Downloadable

    paddleocr

    Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.
    Optical Character Recognition
    190K
    8mo
    NVIDIA
    Downloadable

    studiovoice

    Enhance speech by correcting common audio degradations to create studio quality speech output.
    Nvidia Maxine
    640
    9mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-asr

    State-of-the-art accuracy and speed for English transcriptions.
    ASR
    8.31K
    9mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    chat
    5.57M
    8mo
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Embedding
    3.14M
    7mo
    NVIDIA
    Downloadable

    nvclip

    NV-CLIP is a multimodal embeddings model for image and text.
    Computer vision
    19.7K
    9mo
    Items per page
    of 1 pages