NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

22 results for

Filters

  • Free Endpoint
    13
  • Partner Endpoint
    12
  • Download Available
    9
  • Image-to-Text
    8
  • Code Generation
    6
  • Deep Infra
    6
  • Together AI
    6
  • Bitdeer AI
    4
  • CoreWeave
    3
  • GMI Cloud
    2
  • Google
    5
  • Meta
    5
  • Mistral AI
    5
  • Microsoft
    2
  • NVIDIA
    2
  • Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    11mo
    Items per page
    of 1 pages
    420K
    Mistral AI
    Deprecation in 11dFree Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    Model
    language generation
    1.03M
    9mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    339K
    9mo
    Mistral AI
    Downloadable

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    language generation
    1.62M
    4mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    3.9M
    4mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    5.25M
    10mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    291K
    1y
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    206K
    1y
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    22.11K436K
    11mo
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    13.7K765K
    11mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    2.16M
    10mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    30.26K
    1y
    Google
    Deprecation in 12dFree Endpoint

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    Vision Assistant
    4.16M
    11mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    1.07M
    9mo
    Meta
    Free Endpoint

    llama-4-maverick-17b-128e-instruct

    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    Model
    language generation
    12.54M
    9mo
    Mistral AI
    Downloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    Chat
    551K
    10mo
    NVIDIA
    Downloadable

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    4.8M
    6mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    532K
    11mo
    Microsoft
    Free Endpoint

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Model
    Speech Recognition
    399K
    11mo
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    13.74M
    9mo
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    397K
    11mo
    Qwen
    Downloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    Model
    code completion
    2.78M
    10mo