NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters

  • Free Endpoint
    4
  • Partner Endpoint
    2
  • Download Available
    1
  • Text Translation
    2
  • Code Generation
    1
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Image-to-Text
    0
  • Fireworks AI
    2
  • Deep Infra
    2
  • Together AI
    1
  • GMI Cloud
    0
  • Bitdeer AI
    0
  • Qwen
    2
  • Baichuan AI
    1
  • MediaTek
    1
  • THUDM
    1
  • NVIDIA
    0
  • 5 models
    Qwen
    Downloadable

    qwen2.5-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    2.3M
    10mo
    Qwen
    Free Endpoint

    qwen2-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    739K
    10mo
    THUDM
    Free Endpoint

    chatglm3-6b

    Supports Chinese and English languages to handle tasks including chatbot, content generation, coding, and translation.
    chat
    611K
    8mo
    Baichuan AI
    Free Endpoint

    baichuan2-13b-chat

    Support Chinese and English chat, coding, math, instruction following, solving quizzes
    Chinese Language Generation
    587K
    10mo
    MediaTek
    Free Endpoint

    breeze-7b-instruct

    LLM for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese.
    chat
    580K
    10mo
    Items per page
    of 1 pages