Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
Publisher
Use Case
NIM Type
Sorting by Most Recent

Multi-modal vision-language model that understands text/img/video and creates informative responses

Generates physics-aware video world states from text and image prompts for physical AI development.

Chart Extraction. This is a context aware chart element detection model that can detect 18 classes for chart basic elements, excluding plot elements.

Shutterstock Generative 3D service for 360 HDRi generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries.

Multi-modal vision-language model that understands text/img/video and creates informative responses

Shutterstock Generative 3D service for 3D asset generation. Trained on NVIDIA Edify using Shutterstock’s licensed creative libraries

One-shot visual language understanding model that translates images of plots into tables.