Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

43 results for

Filters

  • Free Endpoint
    17
  • Partner Endpoint
    7
  • Download Available
    23
  • Launchable
    5
  • Developer Example
    4
  • Enterprise Blueprint
    2
  • Retrieval Augmented Generation
    4
  • Object Detection
    3
  • Image-to-Text
    2
  • Text-to-Embedding
    2
  • Optical Character Recognition
    1
  • Deepinfra
    5
  • Bitdeer
    3
  • Together AI
    3
  • Digital Ocean
    2
  • Lightning AI
    2
  • NVIDIA
    42
  • Mistral AI
    1
  • AI Engineer
    4
  • Developer
    4
  • Data Scientist
    2
  • Ml Engineer
    2
  • Application Developer
    1
  • NVIDIA AI
    6
  • AI And Machine Learning
    4
  • A100 SXM4 80GB
    3
  • A10G
    3
  • H100 80GB HBM3
    3
  • H100 NVL
    3
  • H200
    3
  • Nemotron
    3
  • Riva
    1
  • NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    Model
    English
    1.77K
    3mo

    Routes NVIDIA Nemotron Speech (Riva) NIM tasks — deploys, runs, and tests ASR, TTS, and NMT NIMs on build.nvidia.com or self-hosted.
    Skill
    Developer
    467
    14d
    Items per page
    of 2 pages

    Plan, configure, and chain repo-native Nemotron customization steps into single-step or multi-step pipelines: curation, translation, SFT/PEFT (AutoModel or Megatron-Bridge), pretraining/CPT, RL alignment (DPO/RLVR/GRPO/RLHF), BYOB/MCQ benchmarks, checkpoi
    Skill
    Developer
    535
    20d
    NVIDIA
    Downloadable

    nemotron-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    text and table extraction
    218K
    7mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    1.49M
    1y
    General
    Developer Example

    Nemotron Voice Agent

    Build Real-Time Voice Agents with NVIDIA Nemotron NIM.
    Blueprint
    Voice Agent
    3mo

    Generates BYO custom safety policies for NVIDIA Nemotron content-safety guardrails — Nemotron-Content-Safety-Reasoning-4B (text) and multimodal Nemotron-3-Content-Safety. Produces a Markdown policy, JSON taxonomy, and drop-in inference prompts. Maps rough
    Skill
    Developer
    405
    14d

    Use when planning, debugging, tuning, evaluating, exporting, or deploying public Nemotron `embed`/`rerank` retrieval recipes.
    Skill
    Developer
    508
    18d
    NVIDIA
    Free Endpoint

    nemotron-3-content-safety

    Multilingual, multimodal model for detecting unsafe and toxic content.
    Model
    llm safety
    230K
    2mo
    NVIDIA
    Free Endpoint

    nemotron-3.5-content-safety

    Multilingual, multimodal model for detecting unsafe and toxic content.
    Model
    llm safety
    337K
    16d
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Model
    Automatic Speech Recognition
    8.88K
    3mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    39.78K
    3mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.53M
    1y
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Model
    Table Extraction
    341K
    3mo
    NVIDIA
    Downloadable

    nemotron-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    433K
    3mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Model
    Object Detection
    157K
    3mo
    DGX Spark
    30 MIN

    Nemotron-3-Nano with llama.cpp

    Run Nemotron-3-Nano-30B model using llama.cpp on DGX Spark
    Playbook
    Nemotron
    6mo
    NVIDIA
    Free Endpoint

    nemotron-content-safety-reasoning-4b

    A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
    Model
    NeMo Guardrails
    145K
    4mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    2.47M
    7mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    Model
    MoE
    11.91M
    6mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    MoE
    60.41M
    3mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-ultra-550b-a55b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Model
    Agent
    7.73M
    14d
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Model
    Text-to-Embedding
    4.45M
    3mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    Model
    nemo retriever
    501K
    3mo