Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    36
  • Partner Endpoint
    9
  • Download Available
    57
  • Speech-to-Text
    9
  • Retrieval Augmented Generation
    8
  • Synthetic Data Generation
    7
  • Text-to-Embedding
    4
  • Object Detection
    4
  • Deep Infra
    8
  • Bitdeer AI
    2
  • Lightning AI
    2
  • Together AI
    1
  • CoreWeave
    1
  • NVIDIA
    76
  • OpenAI
    1
  • Meta
    0
  • Mistral AI
    0
  • Qwen
    0
  • H100 80GB HBM3
    14
  • L40S
    13
  • A100 SXM4 80GB
    12
  • H100 NVL
    8
  • B200
    7
  • 77 models
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-ultra-550b-a55b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    Agent
    Today
    Items per page
    of 4 pages
    NVIDIA
    Free Endpoint

    nemotron-3.5-content-safety

    Multilingual, multimodal model for detecting unsafe and toxic content.
    llm safety
    248
    2d
    NVIDIA
    Free Endpoint

    cosmos3-nano

    Generates physics-aware videos from text prompts or an image prompt for physical AI development.
    autonomous vehicles
    762
    3d
    NVIDIA
    DownloadableFree Endpoint

    cosmos3-nano-reasoner

    Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
    B200
    719
    3d
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-nano-omni-30b-a3b-reasoning

    Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
    Image-to-Text
    8.91M
    1mo
    NVIDIA
    Downloadable

    Relighting

    Re-illuminate people in video to match target lighting from a 360 HDRI environment map.
    HDRI
    304
    1mo
    NVIDIA
    Free Endpoint

    nemotron-3-content-safety

    Multilingual, multimodal model for detecting unsafe and toxic content.
    llm safety
    218K
    1mo
    NVIDIA
    DownloadableFree Endpoint

    synthetic-video-detector

    NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.
    broadcast
    91.52K
    1mo
    NVIDIA
    DownloadableFree Endpoint

    Active Speaker Detection

    Detect and track speaker identities across video frames.
    broadcast
    337
    1mo
    NVIDIA
    Downloadable

    LipSync

    Generative lip dubbing that syncs lips in a video to input audio.
    broadcast
    1mo
    NVIDIA
    DownloadableFree Endpoint

    ising-calibration-1-35b-a3b

    Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
    Quantum
    328K
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    73.2K
    2mo
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    English
    2.13K
    2mo
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Automatic Speech Recognition
    9.46K
    2mo
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    363K
    2mo
    NVIDIA
    DownloadableFree Endpoint

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    MoE
    57.73M
    2mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    410K
    2mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    154K
    3mo
    NVIDIA
    Downloadable

    nemotron-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    404K
    3mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    38.74K
    3mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    12.61M
    3mo
    NVIDIA
    Free Endpoint

    gliner-pii

    GLiNER PII detects Personally Identifiable Information in text.
    PII Detection
    256K
    3mo
    NVIDIA
    Free Endpoint

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    3mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-vl-1b-v2

    Multimodal question-answer retrieval representing user queries as text and documents as images.
    nemo retriever
    6.66M
    3mo