Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

22 results for

Filters

  • Free Endpoint
    12
  • Partner Endpoint
    10
  • Download Available
    8
  • Developer Example
    1
  • Launchable
    1
  • Image-to-Text
    7
  • Code Generation
    6
  • Deep Infra
    5
  • Together AI
    4
  • CoreWeave
    3
  • Bitdeer AI
    2
  • GMI Cloud
    2
  • Meta
    5
  • Mistral AI
    5
  • Google
    4
  • NVIDIA
    3
  • Microsoft
    2
  • NVIDIA AI
    1
  • A100 PG509 200
    1
  • A100 SXM4 80GB
    1
  • A10G
    1
  • B200
    1
  • GB200
    1
  • Mistral AI

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    language generation
    2.96M
    6mo
    Items per page
    of 1 pages
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    768K
    1y
    Mistral AI
    DeprecatedFree Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    Model
    language generation
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    44.56M
    10mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    3.44M
    6mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    4.11M
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.68M
    1y
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    502K
    1y
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    42.64K349K
    1y
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    25.01K1.05M
    1y
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    31.46M
    10mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.75M
    11mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    10.56K
    1y
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    3.45M
    10mo
    Meta
    Free Endpoint

    llama-4-maverick-17b-128e-instruct

    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    Model
    language generation
    22.78M
    10mo
    NVIDIA
    Downloadable

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    2.23M
    7mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    444K
    1y
    Microsoft
    Free Endpoint

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Model
    Speech Recognition
    358K
    1y
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    604K
    1y
    Mistral AI
    DeprecatedDownloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    B200
    63.21K
    11mo
    Qwen
    DeprecatedDownloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    Model
    code completion
    943K
    11mo
    Retail
    LaunchableDeveloper Example

    Multi-Agent Intelligent Warehouse

    An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.
    Blueprint
    NVIDIA AI
    3mo