NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    11
  • Partner Endpoint
    6
  • Download Available
    6
  • Code Generation
    5
  • Image-to-Text
    3
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Object Detection
    0
  • Deep Infra
    4
  • Together AI
    2
  • Bitdeer AI
    1
  • CoreWeave
    1
  • GMI Cloud
    0
  • Mistral AI
    4
  • Microsoft
    4
  • NVIDIA
    2
  • Qwen
    2
  • Rakuten
    2
  • Enterprise
    0
  • NVIDIA BioNemo
    0
  • language generation
  • 17 models
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    language generation
    5.51M
    4mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    language generation
    1.53M
    4mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Chat
    318K
    10mo
    Qwen
    DeprecatedDownloadable

    qwen2.5-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    7.2M
    11mo
    NVIDIA
    DeprecatedFree Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Indic
    363K
    11mo
    Qwen
    DeprecatedFree Endpoint

    qwen2-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    145K
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Chat
    178K
    1y
    Microsoft
    DeprecatedFree Endpoint

    phi-3.5-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Chat
    1.79M
    1y
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    45.49K
    11mo
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    45.99K
    11mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Chat
    836K
    10mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    image
    43.45K
    1y
    AI Singapore
    DeprecatedFree Endpoint

    sea-lion-7b-instruct

    LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
    Chat
    1y
    Microsoft
    DeprecatedDownloadable

    phi-3-mini-4k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Chat
    78.61K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-mini-128k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Chat
    106K
    10mo
    Meta
    DeprecatedDownloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    761K
    11mo
    Mistral AI
    DeprecatedFree Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Text-to-Text
    155K
    11mo
    Items per page
    of 1 pages