NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIALaunch from Hugging FaceBeta

Filters (1)

  • API Endpoint
    38
  • Download Available
    21
  • Code Generation
    22
  • Image-to-Text
    8
  • Text Translation
    2
  • Drug Discovery
    0
  • Retrieval Augmented Generation
    0
  • Microsoft
    9
  • Meta
    8
  • Mistral AI
    8
  • Google
    8
  • NVIDIA
    4
  • chat
  • 58 models
    Mistral AI
    mistral-large-3-675b-instruct-2512
    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    language generation
    3mo
    Mistral AI
    ministral-14b-instruct-2512
    A general purpose VLM ideal for chat and instruction based use cases
    language generation
    3mo
    NVIDIA
    nemotron-nano-12b-v2-vl
    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    language generation
    4mo
    Speakleash
    bielik-11b-v2.6-instruct
    State-of-the-art model for Polish language processing tasks such as text generation, Q&A, and chatbots.
    Polish
    5mo
    Google
    gemma-3n-e4b-it
    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    language generation
    7mo
    Google
    gemma-3n-e2b-it
    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    language generation
    7mo
    Mistral AI
    mistral-nemotron
    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    language generation
    8mo
    Utter-project
    eurollm-9b-instruct
    State-of-the-art, multilingual model tailored to all 24 official European Union languages.
    Sovereign AI
    8mo
    Gotocompany
    gemma-2-9b-cpt-sahabatai-instruct
    SOTA LLM pre-trained for instruction following and proficiency in Indonesian language and its dialects.
    Sovereign AI
    8mo
    Mistral AI
    mistral-small-3.1-24b-instruct-2503
    Efficient multimodal model excelling at multilingual tasks, image understanding, and fast-responses
    language generation
    9mo
    Mistral AI
    mistral-medium-3-instruct
    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    language generation
    7mo
    Meta
    llama-4-maverick-17b-128e-instruct
    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    language generation
    7mo
    Meta
    llama-4-scout-17b-16e-instruct
    A multimodal, multilingual 16 MoE model with 17B parameters.
    language generation
    7mo
    Google
    gemma-3-27b-it
    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Vision Assistant
    9mo
    Google
    gemma-3-1b-it
    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Translation
    9mo
    Microsoft
    phi-4-mini-instruct
    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    chat
    9mo
    Microsoft
    phi-4-multimodal-instruct
    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Speech Recognition
    9mo
    Tiiuae
    falcon3-7b-instruct
    Instruction tuned LLM achieving SoTA performance on reasoning, math and general knowledge capabilities
    Coding
    9mo
    Qwen
    qwen2.5-7b-instruct
    Chinese and English LLM targeting for language, coding, mathematics, reasoning, etc.
    Chinese Language Generation
    9mo
    Qwen
    qwen2.5-coder-32b-instruct
    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    code completion
    8mo
    Qwen
    qwen2.5-coder-7b-instruct
    Powerful mid-size code model with a 32K context length, excelling in coding in multiple languages.
    code completion
    9mo
    NVIDIA
    nemotron-4-mini-hindi-4b-instruct
    A bilingual Hindi-English SLM for on-device inference, tailored specifically for Hindi Language.
    Indic
    9mo
    Institute of Science Tokyo
    llama-3.1-swallow-70b-instruct-v0.1
    Sovereign AI model trained on Japanese language that understands regional nuances.
    Sovereign AI
    9mo
    Institute of Science Tokyo
    llama-3.1-swallow-8b-instruct-v0.1
    Sovereign AI model trained on Japanese language that understands regional nuances.
    Sovereign AI
    9mo
    Items per page
    of 3 pages