NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (2)

  • Free Endpoint
    3
  • Partner Endpoint
    5
  • Download Available
    2
  • Code Generation
    0
  • Drug Discovery
    0
  • Retrieval Augmented Generation
    0
  • Image-to-Text
    0
  • Object Detection
    0
  • Deep Infra
    4
  • Fireworks AI
    3
  • GMI Cloud
    3
  • Together AI
    2
  • Bitdeer AI
    2
  • NVIDIA
    2
  • Moonshotai
    2
  • DeepSeek AI
    1
  • Meta
    0
  • Mistral AI
    0
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • reasoning
  • long-context
  • 5 models
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    chat
    16.67M
    2w
    DeepSeek AI
    Free Endpoint

    deepseek-v3.2

    State-of-the-art 685B reasoning LLM with sparse attention, long context, and integrated agentic tools.
    chat
    16.46M
    3mo
    NVIDIA
    Downloadable

    nemotron-3-nano-30b-a3b

    Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more
    chat
    13.41M
    3mo
    Moonshotai
    Free Endpoint

    kimi-k2-thinking

    Open reasoning model with 256K context window, native INT4 quantization and enhanced tool use.
    Conversational
    3.26M
    3mo
    Moonshotai
    Free Endpoint

    kimi-k2-instruct-0905

    Follow-on version of Kimi-K2-Instruct with longer context window and enhanced reasoning capabilities.
    long-context
    11.44M
    6mo
    Items per page
    of 1 pages