nvidia/nvclip

RUN ANYWHERE

NV-CLIP is a multimodal embeddings model for image and text.

Input

Upload 1-3 images and use the textbox to put something relevant to one of the images.

Output