Robust Speech Recognition via Large-Scale Weak Supervision.
Multi-lingual model supporting speech-to-text recognition and translation.
Multi-lingual model supporting speech-to-text recognition and translation.
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.
Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots
State-of-the-art accuracy and speed for English transcriptions.
Novel recurrent architecture based language model for faster inference when generating long sequences.
A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation