NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    1
  • Partner Endpoint
    3
  • Download Available
    3
  • Image Generation
    1
  • Text-to-Image
    1
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Image-to-Text
    0
  • Deep Infra
    2
  • Together AI
    1
  • GMI Cloud
    1
  • Bitdeer AI
    1
  • CoreWeave
    0
  • Mistral AI
    1
  • Qwen
    1
  • Black forest labs
    1
  • ByteDance
    1
  • NVIDIA
    0
  • A100 SXM4 80GB
    0
  • B200
    0
  • GB200
    0
  • GH200 144G HBM3e
    0
  • H100 80GB HBM3
    0
  • 4 models
    Mistral AI
    Downloadable

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    code generation
    Items per page
    of 1 pages
    9.25M
    1mo
    Qwen
    Downloadable

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    text-generation
    17.76M
    7mo
    ByteDance
    Free Endpoint

    seed-oss-36b-instruct

    ByteDance open-source LLM with long-context, reasoning, and agentic intelligence.
    thinking budget
    1.11M
    7mo
    Black-forest-labs
    Downloadable

    FLUX.1-Kontext-dev

    FLUX.1 Kontext is a multimodal model that enables in-context image generation and editing.
    Text-to-Image
    2.94K
    8mo