NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    33
  • Partner Endpoint
    10
  • Download Available
    66
  • Retrieval Augmented Generation
    14
  • Object Detection
    9
  • Speech-to-Text
    9
  • Text-to-Embedding
    8
  • Synthetic Data Generation
    7
  • Deep Infra
    7
  • Together AI
    3
  • Bitdeer AI
    2
  • CoreWeave
    1
  • Digital Ocean
    1
  • NVIDIA
    94
  • Mistral AI
    1
  • OpenAI
    1
  • Meta
    0
  • Microsoft
    0
  • 96 models
    NVIDIA
    Downloadable

    NVIDIA AI for Media Relighting

    Re-illuminate people in video to match target lighting from a 360 HDRI environment map.
    HDRI
    4d
    Items per page
    of 4 pages
    160
    NVIDIA
    Free Endpoint

    nemotron-3-content-safety

    Multilingual, multimodal model for detecting unsafe and toxic content.
    llm safety
    16.35K
    5d
    NVIDIA
    DownloadableFree Endpoint

    synthetic-video-detector

    NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.
    broadcast
    298
    5d
    NVIDIA
    DownloadableFree Endpoint

    Active Speaker Detection

    Detect and track speaker identities across video frames.
    localization
    77
    5d
    NVIDIA
    Downloadable

    LipSync

    Generative lip dubbing that syncs lips in a video to input audio.
    lipsync
    5d
    NVIDIA
    Downloadable

    ising-calibration-1-35b-a3b

    Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
    Quantum
    58.35K
    1w
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    3.13K
    3w
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    English
    5.14K
    1mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Automatic Speech Recognition
    23.59K
    1mo
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    1.87M
    1mo
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    MoE
    49.92M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    179K
    1mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    19.78K
    1mo
    NVIDIA
    Downloadable

    nemotron-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    45.94K
    1mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.69K
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    1.95M
    1mo
    NVIDIA
    Free Endpoint

    gliner-pii

    GLiNER PII detects Personally Identifiable Information in text.
    PII Detection
    118K
    1mo
    NVIDIA
    Free Endpoint

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    9.71M
    2mo
    NVIDIA
    Free Endpoint

    nemotron-content-safety-reasoning-4b

    A context‑aware safety model that applies reasoning to enforce domain‑specific policies.
    NeMo Guardrails
    293K
    2mo
    NVIDIA
    Downloadable

    cosmos-reason2-8b

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    video understanding
    164K
    3mo
    NVIDIA
    Downloadable

    nemoretriever-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    61.78K
    4mo
    NVIDIA
    Downloadable

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    MoE
    12.12M
    4mo
    NVIDIA
    Free Endpoint

    riva-translate-4b-instruct-v1_1

    Translation model in 12 languages with few-shots example prompts capability.
    nvidia nim
    209K
    4mo