NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    2
  • Partner Endpoint
    2
  • Download Available
    2
  • Code Generation
    1
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Image-to-Text
    0
  • Object Detection
    0
  • Deep Infra
    1
  • Fireworks AI
    1
  • Together AI
    1
  • Bitdeer AI
    1
  • CoreWeave
    1
  • NVIDIA
    2
  • Mistral AI
    1
  • AI21 Labs
    1
  • Meta
    0
  • Microsoft
    0
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • 4 models
    NVIDIA
    Downloadable

    nemotron-3-super-120b-a12b

    Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more
    chat
    29.29M
    2w
    NVIDIA
    Downloadable

    nvidia-nemotron-nano-9b-v2

    High‑efficiency LLM with hybrid Transformer‑Mamba design, excelling in reasoning and agentic tasks.
    chat
    509K
    7mo
    AI21 Labs
    Free Endpoint

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    chat
    323K
    10mo
    Mistral AI
    Free Endpoint

    mamba-codestral-7b-v0.1

    Model for writing and interacting with code across a wide range of programming languages and tasks.
    chat
    406K
    10mo
    Items per page
    of 1 pages