Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications

Multi-lingual model supporting speech-to-text recognition and translation.

Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.