NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    33
  • Partner Endpoint
    9
  • Download Available
    62
  • Retrieval Augmented Generation
    13
  • Object Detection
    9
  • Text-to-Embedding
    8
  • Speech-to-Text
    8
  • Synthetic Data Generation
    7
  • Deep Infra
    6
  • Together AI
    3
  • Fireworks AI
    2
  • Bitdeer AI
    2
  • CoreWeave
    1
  • NVIDIA
    93
  • Mistral AI
    1
  • OpenAI
    1
  • Igenius
    1
  • Meta
    0
  • Enterprise
    1
  • NVIDIA BioNemo
    1
  • 96 models
    NVIDIA
    Enterprise

    Build A Generative Protein Binder Design Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
    generative-virtual-screening-for-drug-discovery
    2.26K
    5d
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    English
    3.72K
    1w
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Automatic Speech Recognition
    1.43K
    2w
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    46.36K
    2w
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    chat
    20.83M
    2w
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    12.18K
    3w
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    12.28K
    3w
    NVIDIA
    Downloadable

    nemotron-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    29.65K
    3w
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    10.45K
    3w
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    806K
    3w
    NVIDIA
    Free Endpoint

    gliner-pii

    GLiNER PII detects Personally Identifiable Information in text.
    PII Detection
    167K
    3w
    NVIDIA
    Free Endpoint

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    3.64M
    1mo
    NVIDIA
    Free Endpoint

    nemotron-content-safety-reasoning-4b

    A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
    NeMo Guardrails
    558K
    2mo
    NVIDIA
    Downloadable

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    video understanding
    164K
    3mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    466K
    3mo
    NVIDIA
    Downloadable

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    chat
    13.61M
    3mo
    NVIDIA
    Free Endpoint

    riva-translate-4b-instruct-v1_1

    Translation model in 12 languages with few-shots example prompts capability.
    nvidia nim
    531K
    3mo
    NVIDIA
    Free Endpoint

    streampetr

    StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.
    autonomous vehicles
    258K
    4mo
    NVIDIA
    Downloadable

    nemotron-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    text and table extraction
    520K
    5mo
    NVIDIA
    Downloadable

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    chat
    1.22M
    5mo
    NVIDIA
    Free Endpoint

    llama-3.1-nemotron-safety-guard-8b-v3

    Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs
    content moderation
    582K
    5mo
    NVIDIA
    Downloadable

    parakeet-ctc-0.6b-zh-tw

    Record-setting accuracy and performance for Mandarin Taiwanese English transcriptions.
    ASR
    363
    5mo
    NVIDIA
    Downloadable

    llama-3_2-nemoretriever-300m-embed-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    6.01K
    5mo
    Items per page
    of 4 pages