Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    3
  • Partner Endpoint
    2
  • Download Available
    2
  • Image-to-Text
    1
  • Drug Discovery
    0
  • Retrieval Augmented Generation
    0
  • Speech-to-Text
    0
  • Code Generation
    0
  • Deepinfra
    2
  • OpenRouter
    2
  • GMI Cloud
    2
  • Together AI
    1
  • Vultr
    1
  • Qwen
    2
  • Google
    1
  • NVIDIA
    0
  • Meta
    0
  • Mistral AI
    0
  • B200
    1
  • GB200
    1
  • H100 80GB HBM3
    0
  • L40S
    0
  • H200
    0
  • image
  • 3 models
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    tool calling
    Items per page
    of 1 pages
    10M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    MoE
    13M
    4mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    image
    10K
    1y