Models
Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservicesSorting by Most Recent

nvidiacanary-1b-asr
Multi-lingual model supporting speech-to-text recognition and translation.

nvidiacanary-0.6b-turbo-asr
Multi-lingual model supporting speech-to-text recognition and translation.

nvidiacosmos-nemotron-34b
Multi-modal vision-language model that understands text/img/video and creates informative responses

abacusaidracarys-llama-3.1-70b-instruct
Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.

ibmgranite-34b-code-instruct
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.

ibmgranite-8b-code-instruct
Software programming LLM for code generation, completion, explanation, and multi-turn conversion.