Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

2 results for

Filters

  • NVIDIA
    2
  • AI Engineer
    2
  • Data Scientist
    2
  • Developer
    2
  • Ml Engineer
    2
  • AI And Machine Learning
    2
  • TAO Toolkit
    2
  • Two-step image grounding pipeline: extracts referring expressions from (image, caption) pairs and grounds them to pixel-space bounding boxes via a VLM. Use when the user wants to ground captions to bboxes, generate phrase-grounded annotations, auto-label
    Skill
    Developer
    456
    10d

    Four-step image referring-expression pipeline: turns images plus KITTI bounding-box labels into region descriptions, scene captions, grounded referring expressions, and (optionally) verified expressions via VLM distillation. Use when the user wants to gen
    Skill
    Developer
    455
    10d
    Items per page
    of 1 pages