Try NVIDIA NIM APIs

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Publisher

Use Case

NIM Type

Sorting by Most Recent

Multi-modal model to classify safety for input prompts as well output responses.

Multi-modal vision-language model that understands text/img and creates informative responses

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

Multi-lingual model supporting speech-to-text recognition and translation.

Multi-lingual model supporting speech-to-text recognition and translation.

Multi-modal vision-language model that understands text/img/video and creates informative responses

Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

Multi-modal vision-language model that understands text/img/video and creates informative responses

Embedding model for text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval.

Multi-modal vision-language model that understands text/images and generates informative responses

Multi-modal model for a wide range of tasks, including image understanding and language generation.