NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    3
  • Partner Endpoint
    4
  • Download Available
    1
  • Code Generation
    0
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Image-to-Text
    0
  • Object Detection
    0
  • Deep Infra
    3
  • Fireworks AI
    3
  • GMI Cloud
    2
  • Bitdeer AI
    2
  • CoreWeave
    2
  • NVIDIA
    1
  • Qwen
    1
  • DeepSeek AI
    1
  • Moonshotai
    1
  • Meta
    0
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • long-context
  • 4 models
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    chat
    33.52M
    3w
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    chat
    15.8M
    3mo
    Moonshotai
    Free Endpoint

    kimi-k2-instruct-0905

    Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
    long-context
    14.28M
    6mo
    Qwen
    Free Endpoint

    qwen3-coder-480b-a35b-instruct

    Excels in agentic coding and browser use and supports 256K context, delivering top results.
    agentic coding
    3.59M
    7mo
    Items per page
    of 1 pages