FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds
Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
Robust Speech Recognition via Large-Scale Weak Supervision.
Multi-lingual model supporting speech-to-text recognition and translation.
Multi-lingual model supporting speech-to-text recognition and translation.
Continuously extract, embed, and index multimodal data for fast, accurate semantic search. Built on world-class NeMo Retriever models, the RAG blueprint connects AI applications to multimodal enterprise data wherever it resides.
This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
This blueprint shows how generative AI and accelerated NIM microservices can design optimized small molecules smarter and faster.
Expressive and engaging English voices for Q&A assistants, brand ambassadors, and service robots
State-of-the-art accuracy and speed for English transcriptions.
Novel recurrent architecture based language model for faster inference when generating long sequences.
A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation