NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • API Endpoint
    38
  • Download Available
    21
  • Code Generation
    22
  • Image-to-Text
    8
  • Text Translation
    2
  • Retrieval Augmented Generation
    0
  • Drug Discovery
    0
  • Microsoft
    9
  • Meta
    8
  • Mistral AI
    8
  • Google
    8
  • NVIDIA
    4
  • chat
  • 58 models
    Mistral AI

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    language generation
    5.92M
    3mo
    Mistral AI

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    language generation
    4.3M
    3mo
    NVIDIA

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    language generation
    1.49M
    4mo
    Speakleash

    bielik-11b-v2.6-instruct

    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Polish
    515K
    5mo
    Google

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    language generation
    684K
    7mo
    Google

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    language generation
    620K
    7mo
    Mistral AI

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    language generation
    695K
    9mo
    Utter-project

    eurollm-9b-instruct

    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Sovereign AI
    4.89K465K
    8mo
    Gotocompany

    gemma-2-9b-cpt-sahabatai-instruct

    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Sovereign AI
    465K
    8mo
    Mistral AI

    mistral-small-3.1-24b-instruct-2503

    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    language generation
    1.41M
    9mo
    Mistral AI

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    language generation
    4.39M
    8mo
    Meta

    llama-4-maverick-17b-128e-instruct

    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    language generation
    3.01M
    7mo
    Meta

    llama-4-scout-17b-16e-instruct

    A multimodal, multilingual 16 MoE model with 17B parameters.
    language generation
    210K
    7mo
    Google

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Vision Assistant
    5.48M
    9mo
    Google

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Translation
    4.47K501K
    9mo
    Microsoft

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    chat
    2.4M
    9mo
    Microsoft

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Speech Recognition
    449K
    9mo
    Tiiuae

    falcon3-7b-instruct

    Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
    Coding
    799K
    9mo
    Qwen

    qwen2.5-7b-instruct

    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    1.19M
    9mo
    Qwen

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    code completion
    5.27M
    8mo
    Qwen

    qwen2.5-coder-7b-instruct

    Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
    code completion
    546K
    9mo
    NVIDIA

    nemotron-4-mini-hindi-4b-instruct

    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Indic
    471K
    9mo
    Institute of Science Tokyo

    llama-3.1-swallow-70b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Sovereign AI
    461K
    9mo
    Institute of Science Tokyo

    llama-3.1-swallow-8b-instruct-v0.1

    Sovereign AI model trained on Japanese language that understands regional nuances.
    Sovereign AI
    472K
    9mo
    Items per page
    of 3 pages