NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    96
  • Partner Endpoint
    71
  • Download Available
    130
  • Code Generation
    29
  • Retrieval Augmented Generation
    15
  • Drug Discovery
    14
  • Image-to-Text
    13
  • Object Detection
    9
  • Deep Infra
    51
  • Together AI
    43
  • GMI Cloud
    24
  • Bitdeer AI
    18
  • CoreWeave
    12
  • NVIDIA
    97
  • Meta
    15
  • Mistral AI
    14
  • Microsoft
    12
  • Google
    11
  • Enterprise
    1
  • NVIDIA BioNemo
    1
  • 226 models
    NVIDIA
    Free Endpoint

    ising-calibration-1-35b-a3b

    Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
    Quantum
    Today
    Minimaxai
    Free Endpoint

    minimax-m2.7

    MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    coding
    567K
    3d
    NVIDIA
    DeprecatedFree Endpoint

    audio2face-3d-claire-notongue

    Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
    Digital Humans
    40
    5d
    NVIDIA
    DeprecatedFree Endpoint

    audio2face-3d-james-notongue

    Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
    Digital Humans
    5d
    NVIDIA
    DeprecatedFree Endpoint

    audio2face-3d-james

    Converts streamed audio to facial blendshapes for realtime lipsyncing and facial performances.
    Digital Humans
    223
    5d
    Google
    Downloadable

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    coding
    1.47M
    1w
    NVIDIA
    Downloadable

    llama-nemotron-rerank-vl-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    659
    2w
    NVIDIA
    Enterprise

    Build A Generative Protein Binder Design Pipeline

    This blueprint shows how generative AI and accelerated NIM microservices can design protein binders smarter and faster.
    NVIDIA BioNemo
    3.13K
    3w
    Mistral AI
    Downloadable

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    code generation
    7.15M
    4w
    NVIDIA
    Free Endpoint

    nemotron-voicechat

    Nemotron 3 Voicechat
    English
    6.7K
    4w
    NVIDIA
    Downloadable

    nemotron-asr-streaming

    Real-time speech recognition for English
    Automatic Speech Recognition
    18.81K
    1mo
    Black-forest-labs
    Downloadable

    flux.2-klein-4b

    FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed
    Text-to-Image
    97.95K
    1mo
    NVIDIA
    Downloadable

    nemotron-ocr-v1

    Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.
    Table Extraction
    1.18M
    1mo
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    MoE
    48.93M
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-rerank-1b-v2

    GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.
    nemo retriever
    111K
    1mo
    Qwen
    Downloadable

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    tool calling
    8.34M
    1mo
    NVIDIA
    Downloadable

    nemotron-table-structure-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.46K
    1mo
    NVIDIA
    Downloadable

    nemotron-page-elements-v3

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    43.96K
    1mo
    NVIDIA
    Downloadable

    nemotron-graphic-elements-v1

    Model for object detection, fine-tuned to detect charts, tables, and titles in documents.
    Object Detection
    18.25K
    1mo
    NVIDIA
    Downloadable

    llama-nemotron-embed-1b-v2

    Multilingual, cross-lingual embedding model for long-document QA retrieval, supporting 26 languages.
    Text-to-Embedding
    1.37M
    1mo
    NVIDIA
    Free Endpoint

    gliner-pii

    GLiNER PII detects Personally Identifiable Information in text.
    PII Detection
    57.11K
    1mo
    Minimaxai
    Downloadable

    minimax-m2.5

    MiniMax M2.5 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.
    reasoning
    11.5M
    1mo
    NVIDIA
    Free Endpoint

    cosmos-transfer2.5-2b

    Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
    Synthetic Data Generation
    1mo
    Qwen
    Downloadable

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    MoE
    14.37M
    1mo
    Items per page
    ...
    of 10 pages