NVIDIA
Explore
Models
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2025 NVIDIA Corporation

Search Results

Searching for: AST
Sorting by Most Recent

mistralaidevstral-2-123b-instruct-2512

State-of-the-art open code model with deep reasoning, 256k context, and unmatched efficiency.

codingchatreasoningtext-to-codeagentic

mistralaimistral-large-3-675b-instruct-2512

A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.

language generationchatImage-to-Textmultimodalagentic

nvidiaQuantitative Portfolio Optimization

Enable fast, scalable, and real-time portfolio optimization for financial institutions.

developer exampleLaunchableBlueprintcuoptportfolio optimizationalgorithmic tradingfinancial services

nvidiaAmbient Healthcare Agents

Build advanced AI agents for providers and patients using this developer example powered by NeMo Microservices, NVIDIA Nemotron, Riva ASR and TTS, and NVIDIA LLM NIM

agent blueprintblueprintnimLaunchablenemollmNVIDIA AI

cyborgCyborg Enterprise RAG

Securely extract, embed, and index multimodal data with encryption in-use for fast, accurate semantic search.

NIMLaunchableBlueprintRetrieval-Augmented GenerationNeMo Retriever

nvidiaparakeet-ctc-0.6b-zh-tw

Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.

ASRStreamingTaiwaneseSpeech-to-TextNVIDIA NIM

speakleashbielik-11b-v2.6-instruct

State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.

PolishSovereign AIchatChatbotsSummarization

nvidiaparakeet-ctc-0.6b-zh-cn

Record-setting accuracy and performance for Mandarin English transcriptions.

ASRStreamingSpeech-to-TextMandarinNVIDIA NIM

nvidiaparakeet-ctc-0.6b-es

Accurate and optimized Spanish English transcriptions with punctuation and word timestamps.

ASRStreamingSpeech-to-TextSpanishNVIDIA NIM

nvidiaparakeet-ctc-0.6b-vi

Accurate and optimized Vietnamese-English transcriptions with punctuation and word timestamps.

ASRStreamingSpeech-to-TextVietnameseNVIDIA NIM

deepseek-aideepseek-v3.1

DeepSeek V3.1 Instruct is a hybrid AI model with fast reasoning, 128K context, and strong tool use.

ReasoningchatText-to-Text

nvidianemoretriever-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Optical Character RecognitionTable Extractionnemo retrieverdata ingestionextraction

nvidiaparakeet-tdt-0.6b-v2

Accurate and optimized English transcriptions with punctuation and word timestamps

ASREnglishNVIDIA NIMNVIDIA Rivaspeech-to-text

nvidianemoretriever-ocr

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Optical Character RecognitionTable Extractionnemo retrieverdata ingestionextraction

moonshotaikimi-k2-instruct

State-of-the-art open mixture-of-experts model with strong reasoning, coding, and agentic capabilities

codingchatadvanced reasoningagentic

metallama-guard-4-12b

Multi-modal model to classify safety for input prompts as well output responses.

LLM Multimodal SafetyContent SafetyGuardrailContent Moderator

nvidiallama-3.2-nemoretriever-1b-vlm-embed-v1

Multimodal question-answer retrieval representing user queries as text and documents as images.

nemo retrieverembeddingRetrieval Augmented GenerationText-to-Embedding

nvidiaSafety for Agentic AI

Improve safety, security, and privacy of AI systems at build, deploy and run stages.

securityLaunchableBlueprintsafetyprivacyNemo Guardrailsopen modelsNVIDIA AI

nvidiallama-3.1-nemotron-nano-4b-v1.1

State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents

edgetool callingchatreasoningmath

marinmarin-8b-instruct

State-of-the-art open model trained on open datasets, excelling in reasoning, math, and science.

ReasoningchatScienceOpen ModelMath

qwenqwen3-235b-a22b

Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following

chatcomplex mathadvanced reasoninginstruction following

black-forest-labsFLUX.1-schnell

FLUX.1-schnell is a distilled image generation model, producing high quality images at fast speeds

Image GenerationText-to-ImageRun-on-RTX

utter-projecteurollm-9b-instruct

State-of-the-art, multilingual model tailored to all 24 official European Union languages.

Sovereign AIchatText-to-TextMultilingualEuropeanRegional Language Generation

mistralaimistral-small-3.1-24b-instruct-2503

Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses

language generationchatmultimodalimage understanding

nvidiaparakeet-1.1b-rnnt-multilingual-asr

High accuracy and optimized performance for transcription in 25 languages

ASRStreamingSpeech-to-TextMultilingualNVIDIA NIM

black-forest-labsFLUX.1-dev

FLUX.1 is a state-of-the-art suite of image generation models

Image GenerationText-to-ImageRun-on-RTX

nvidiaBuild an AI Agent for Enterprise Research

Build a custom enterprise research assistant powered by state-of-the-art models that process and synthesize multimodal data, enabling reasoning, planning, and refinement to generate comprehensive reports.

NIMLaunchableLlama NemotronReasoningBlueprintEnterpriseRetrieval-Augmented GenerationNVIDIA AINeMo Retriever

nvidiaTest Multi-Robot Fleets for Industrial Automation

Simulate, test, and optimize physical AI and robotic fleets at scale in industrial digital twins before real-world deployment.

industrialNVIDIA OmniverseBlueprintsimulationEnterpriseomniverse blueprint

nvidiaLLM Router

Route LLM requests to the best model for the task at hand.

LaunchableBlueprintLLM RouterNVIDIA AI

openaiwhisper-large-v3

Robust Speech Recognition via Large-Scale Weak Supervision.

ASRASTSpeech-to-TextbatchwhisperOpenAIMultilingualNVIDIA NIMNVIDIA Riva

nvidiacanary-1b-asr

Multi-lingual model supporting speech-to-text recognition and translation.

Automatic Speech RecognitionAutomatic Speech TranslationNVIDIA NIMNVIDIA Riva

deepseek-aideepseek-r1

State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

chatMathadvanced reasoning

nvidiaBuild an Enterprise RAG Pipeline Blueprint

Power fast, accurate semantic search across multimodal enterprise data with NVIDIA’s RAG Blueprint—built on NeMo Retriever and Nemotron models—to connect your agents to trusted, authoritative sources of knowledge.

NIMLaunchableNemotronBlueprintEnterpriseRetrieval-Augmented GenerationNVIDIA AINeMo Retriever

nvidiausdcode

State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.

OpenUSDSynthetic Data GenerationDigital TwinchatCode Generation

university-at-buffalocached

Context-aware chart extraction that can detect 18 classes for chart basic elements, excluding plot elements.

nemo retrieverChart Element DetectionImage-To-Text

baidupaddleocr

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

Optical Character RecognitionTable ExtractionOptical Character Detectionnemo retrieverdata ingestionrun-on-rtxextraction

metallama-3.2-3b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chatCode GenerationText-to-TextLanguage Generation

metallama-3.2-1b-instruct

Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

chatCode GenerationText-to-TextLanguage Generation

nvidiamistral-nemo-minitron-8b-base

State-of-the-art small language model delivering superior accuracy for chatbot, virtual assistants, and content generation.

language generationtext-to-textchatsmall language model

rakutenrakutenai-7b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatText-to-TextLanguage GenerationLarge Language Models

rakutenrakutenai-7b-chat

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatText-to-TextLanguage GenerationLarge Language Models

nvidiaparakeet-ctc-1.1b-asr

Record-setting accuracy and performance for English transcription.

ASRStreamingEnglishSpeech-to-TextbatchNVIDIA NIM

nvidiaparakeet-ctc-0.6b-asr

State-of-the-art accuracy and speed for English transcriptions.

ASRStreamingEnglishBatchSpeech-to-TextFastNVIDIA NIMRun-on-RTX

metallama-3.1-8b-instruct

Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

chatCode GenerationText-to-TextLanguage GenerationRun-on-RTX

googlepaligemma

Vision language model adept at comprehending text and visual inputs to produce informative responses

imagecvVision AssistantvlmVisual Question Answeringcomputer visionLanguage GenerationImage-to-Textvideo

microsoftphi-3-mini-4k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatCode GenerationText-to-TextLanguage GenerationLarge Language Models

microsoftphi-3-mini-128k-instruct

Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.

chatCode GenerationText-to-TextLanguage GenerationLarge Language Models

metallama3-8b-instruct

Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.

chatCode GenerationText-to-TextLanguage GenerationLarge Language Models