
Open Mixture of Experts LLM (230B, 10B active) for reasoning, coding, and tool-use/agent workflows

Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM

DeepSeek-V3.1: hybrid inference LLM with Think/Non-Think modes, stronger agents, 128K context, strict function calling.

Build advanced AI agents within the biomedical domain using the AI-Q Blueprint and the BioNeMo Virtual Screening Blueprint

Build a data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.

Streamline evaluation, monitoring, and optimization of AI data flywheel with Weights & Biases.

Orchestrate AI agents for data flywheel with MLRun and NVIDIA NeMo microservices.

Automate and optimize the configuration of radio access network (RAN) parameters using agentic AI and a large language model (LLM)-driven framework.

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

Natural and expressive voices in multiple languages. For voice agents and brand ambassadors.

Latency-optimized language model excelling in code, math, general knowledge, and instruction-following.

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

Trace and evaluate AI Agents with Weights & Biases.

Automate voice AI agents with NVIDIA NIM microservices and Pipecat.

Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM.

Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A