NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • Free Endpoint
    11
  • Partner Endpoint
    6
  • Download Available
    7
  • Code Generation
    5
  • Image-to-Text
    4
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Object Detection
    0
  • Deep Infra
    4
  • Together AI
    2
  • Bitdeer AI
    1
  • CoreWeave
    1
  • GMI Cloud
    0
  • Mistral AI
    4
  • Microsoft
    4
  • NVIDIA
    3
  • Qwen
    2
  • Rakuten
    2
  • language generation
  • 18 models
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    language generation
    4mo
    Items per page
    of 1 pages
    5.02M
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    language generation
    1.68M
    4mo
    NVIDIA
    Downloadable

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    language generation
    4.68M
    5mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Chat
    374K
    11mo
    Qwen
    DeprecatedDownloadable

    qwen2.5-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    5.17M
    11mo
    NVIDIA
    DeprecatedFree Endpoint

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Indic
    334K
    11mo
    Qwen
    DeprecatedFree Endpoint

    qwen2-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    115K
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Chat
    214K
    1y
    Microsoft
    DeprecatedFree Endpoint

    phi-3.5-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Chat
    830K
    1y
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    37.83K
    11mo
    Rakuten
    DeprecatedFree Endpoint

    rakutenai-7b-chat

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    37.8K
    11mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Chat
    911K
    10mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    image
    38.52K
    1y
    AI Singapore
    DeprecatedFree Endpoint

    sea-lion-7b-instruct

    LLM to represent and serve the linguistic and cultural diversity of Southeast Asia
    Chat
    1y
    Microsoft
    DeprecatedDownloadable

    phi-3-mini-4k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Chat
    68.06K
    11mo
    Microsoft
    DeprecatedFree Endpoint

    phi-3-mini-128k-instruct

    Lightweight, state-of-the-art open LLM with strong math and logical reasoning skills.
    Chat
    91.01K
    11mo
    Meta
    DeprecatedDownloadable

    llama3-8b-instruct

    Advanced state-of-the-art LLM with language understanding, superior reasoning, and text generation.
    Chat
    599K
    11mo
    Mistral AI
    DeprecatedFree Endpoint

    mistral-7b-instruct-v0.2

    This LLM follows instructions, completes requests, and generates creative text.
    Text-to-Text
    75.24K
    11mo