NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    8
  • Partner Endpoint
    5
  • Download Available
    3
  • Image-to-Text
    2
  • Synthetic Data Generation
    2
  • Digital Twin
    2
  • Retrieval Augmented Generation
    1
  • Code Generation
    1
  • Deep Infra
    4
  • Together AI
    3
  • Bitdeer AI
    3
  • GMI Cloud
    1
  • CoreWeave
    1
  • NVIDIA
    4
  • Mistral AI
    2
  • Z.ai
    2
  • Qwen
    1
  • Moonshotai
    1
  • A100 SXM4 80GB
    0
  • B200
    0
  • GB200
    0
  • GH200 144G HBM3e
    0
  • H100 80GB HBM3
    0
  • 11 models
    Z.ai
    Downloadable

    glm-5.1

    GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.
    Agentic AI
    Items per page
    of 1 pages
    2.53M
    1w
    Z.ai
    Free Endpoint

    glm-4.7

    GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.
    Tool Calling
    4.57M
    1w
    Moonshotai
    Free Endpoint

    kimi-k2-thinking

    Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
    Conversational
    3.37M
    4mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    language generation
    4.15M
    4mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    language generation
    1.6M
    4mo
    Qwen
    Free Endpoint

    qwen3-coder-480b-a35b-instruct

    Excels in agentic coding and browser use and supports 256K context, delivering top results.
    agentic coding
    3.27M
    7mo
    NVIDIA
    Free Endpoint

    usdcode

    State-of-the-art LLM that answers OpenUSD knowledge queries and generates USD-Python code.
    Digital Twin
    9mo
    NVIDIA
    Free Endpoint

    usdvalidate

    Verify compatibility of OpenUSD assets with instant RTX render and rule-based validation.
    Validation
    617
    1y
    NVIDIA
    Free Endpoint

    nv-embed-v1

    Generates high-quality numerical embeddings from text inputs.
    Non-Commercial Use Only
    3.67M
    9mo
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Non-Commercial Use Only
    192K
    1y
    NVIDIA
    Downloadable

    vista-3d

    VISTA-3D is a specialized interactive foundation model for segmenting and anotating human anatomies.
    Interactive Annotation
    757
    1y