Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

8 results for

Filters

  • Download Available
    1
  • Medical Imaging
    1
  • NVIDIA
    8
  • AI Engineer
    7
  • Ml Engineer
    7
  • Developer
    6
  • Application Developer
    5
  • Data Scientist
    2
  • AI And Machine Learning
    7
  • TAO Toolkit
    7
  • NVIDIA
    Downloadable

    vista-3d

    VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.
    Model
    Interactive Annotation
    824
    1y

    MAL (Mask Auto-Label) for weakly-supervised segmentation. Produces segmentation masks from minimal annotations (point or box annotations) using a ViT-MAE backbone. Use when training, evaluating, or running inference for a TAO MAL model. Trigger phrases in
    Skill
    Developer
    648
    17d
    Items per page
    of 1 pages

    Mask Grounding DINO for grounded instance segmentation. Extends Grounding DINO with a mask-prediction head for open-set segmentation guided by text prompts. Use when training, evaluating, exporting, quantizing, or running inference for a TAO Mask-Groundin
    Skill
    AI Engineer
    647
    17d

    Mask2Former for universal image segmentation (panoptic, instance, and semantic). Transformer-based with masked attention for high-quality segmentation results. Use when training, evaluating, exporting, quantizing, or running inference for a TAO Mask2Forme
    Skill
    Developer
    647
    17d

    OneFormer for universal image segmentation. Unifies panoptic, instance, and semantic segmentation with a single architecture using task-conditioned queries. Use when training, evaluating, exporting, quantizing, or running inference for a TAO OneFormer mod
    Skill
    Developer
    646
    17d

    SegFormer for semantic segmentation. Lightweight transformer-based architecture with hierarchical feature extraction, efficient for real-time segmentation tasks. Use when training, evaluating, exporting, quantizing, or running inference for a TAO SegForme
    Skill
    Developer
    646
    17d

    NVPanoptix3D for panoptic 3D scene reconstruction from posed RGB images. Produces 3D panoptic segmentation (semantic, instance, and panoptic masks) with occupancy completion. Built on a VGGT backbone with a Mask2Former-style head and 3D frustum reconstruc
    Skill
    Developer
    648
    17d

    Visual ChangeNet for binary image classification and segmentation in AOI defect detection. Use when training, evaluating, exporting, or running inference for PCB defect detection or visual inspection, comparing image pairs for PASS/NO_PASS classification,
    Skill
    Developer
    650
    17d