NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

18 results for

Filters

  • Free Endpoint
    10
  • Download Available
    7
  • Launchable
    0
  • Retrieval Augmented Generation
    5
  • Text-to-Embedding
    3
  • Object Detection
    2
  • Image Generation
    1
  • Image-to-Embedding
    1
  • NVIDIA
    17
  • Google
    1
  • NVIDIA AI
    0
  • Google
    Deprecation in 5dFree Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    42.45K
    1y
    NVIDIA
    Deprecation in 5dFree Endpoint

    retail-object-detection

    EfficientDet-based object detection network to detect 100 specific retail objects from an input video.
    Model
    Object Detection
    501
    1y
    NVIDIA
    Deprecation in 5dFree Endpoint

    visual-changenet

    Visual Changenet detects pixel-level change maps between two images and outputs a semantic change segmentation mask
    Model
    image
    645
    1y
    NVIDIA
    Deprecation in 5dFree Endpoint

    ocdrnet

    OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.
    Model
    Optical Character Recognition
    1.06K
    1y
    NVIDIA
    Deprecation in 5dFree Endpoint

    nv-dinov2

    NV-DINOv2 is a visual foundation model that generates vector embeddings for the input image.
    Model
    computer vision
    1.01M
    1y
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-cn

    Record-setting accuracy and performance for Mandarin English transcriptions.
    Model
    ASR
    3.87K
    7mo
    NVIDIA
    Free Endpoint

    nv-embed-v1

    Generates high-quality numerical embeddings from text inputs.
    Model
    Non-Commercial Use Only
    4.1M
    8mo
    NVIDIA
    Free Endpoint

    nv-embedcode-7b-v1

    The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
    Model
    nemo retriever
    252K
    10mo
    NVIDIA
    Downloadable

    nv-embedqa-e5-v5

    English text embedding model for question-answering retrieval.
    Model
    Embedding
    10.97M
    8mo
    NVIDIA
    Deprecation in 5dFree Endpoint

    nv-grounding-dino

    Grounding dino is an open vocabulary zero-shot object detection model.
    Model
    Object Detection
    5.41K
    1y
    NVIDIA
    Downloadable

    nv-yolox-page-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    2.03K
    9mo
    NVIDIA
    Downloadable

    llama-3.2-nv-embedqa-1b-v2

    Multilingual and cross-lingual text question-answering retrieval with long context support and optimized data storage efficiency.
    Model
    nemo retriever
    2.74M
    8mo
    NVIDIA
    Downloadable

    llama-3.2-nv-rerankqa-1b-v2

    Fine-tuned reranking model for multilingual, cross-lingual text question-answering retrieval, with long context support.
    Model
    nemo retriever
    182K
    8mo
    NVIDIA
    Free Endpoint

    sparsedrive

    End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.
    Model
    autonomous vehicles
    95
    8mo
    NVIDIA
    Free Endpoint

    streampetr

    StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.
    Model
    autonomous vehicles
    22.67K
    4mo
    NVIDIA

    Vulnerability Analysis for Container Security

    Rapidly identify and mitigate container security vulnerabilities with generative AI.
    Blueprint
    generative ai
    1mo
    NVIDIA
    Downloadable

    maisi

    MAISI is a pre-trained volumetric (3D) CT Latent Diffusion Generative Model.
    Model
    Image Generation
    1.02K
    1y
    NVIDIA
    Downloadable

    nvclip

    NV-CLIP is a multimodal embeddings model for image and text.
    Model
    Computer vision
    79.98K
    10mo
    Items per page
    of 1 pages