Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices
nvidiallama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

nvidiamagpie-tts-multilingual
Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

deepseek-aideepseek-r1-distill-llama-8b
Distilled version of Llama 3.1 8B using reasoning data generated by DeepSeek R1 for enhanced performance.

nvidianemoretriever-table-structure-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

nvidianemoretriever-graphic-elements-v1
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

nvidianemoretriever-page-elements-v2
Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

colabfoldmsa-search
Generates a multiple sequence alignment from a query sequence and a protein sequence database search.

nvidianemoretriever-parse
Cutting-edge vision-language model exceling in retrieving text and metadata from images.