
Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

Record-setting accuracy and performance for Mandarin English transcriptions.

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

Accurate and optimized English transcriptions with punctuation and word timestamps

Multi-modal model to classify safety for input prompts as well output responses.

Multimodal question-answer retrieval representing user queries as text and documents as images.

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

High accuracy and optimized performance for transcription in 25 languages

Robust Speech Recognition via Large-Scale Weak Supervision.

Multi-lingual model supporting speech-to-text recognition and translation.

Record-setting accuracy and performance for English transcription.

State-of-the-art accuracy and speed for English transcriptions.