Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

22 results for

Filters

  • Free Endpoint
    12
  • Partner Endpoint
    10
  • Download Available
    8
  • Developer Example
    1
  • Launchable
    1
  • Image-to-Text
    7
  • Code Generation
    6
  • Deep Infra
    5
  • CoreWeave
    3
  • Together AI
    3
  • Bitdeer AI
    2
  • GMI Cloud
    2
  • Meta
    5
  • Mistral AI
    5
  • Google
    4
  • NVIDIA
    3
  • Microsoft
    2
  • NVIDIA AI
    1
  • A100 PG509 200
    1
  • A100 SXM4 80GB
    1
  • A10G
    1
  • B200
    1
  • GB200
    1
  • Mistral AI

    ministral-14b-instruct-2512

    A general purpose VLM ideal for chat and instruction based use cases
    Model
    language generation
    2.77M
    5mo
    Items per page
    of 1 pages
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    621K
    1y
    Mistral AI
    DeprecatedFree Endpoint

    mistral-medium-3-instruct

    Powerful, multimodal language model designed for enterprise applications, including software development, data analysis, and reasoning.
    Model
    language generation
    10mo
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    44.45M
    10mo
    Mistral AI
    Free Endpoint

    mistral-large-3-675b-instruct-2512

    A state-of-the-art general purpose MoE VLM ideal for chat, agentic and instruction based use cases.
    Model
    language generation
    3.35M
    5mo
    Mistral AI
    Free Endpoint

    mistral-nemotron

    Built for agentic workflows, this model excels in coding, instruction following, and function calling
    Model
    language generation
    4.58M
    11mo
    NVIDIA
    Free Endpoint

    nemotron-mini-4b-instruct

    Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling
    Model
    Chat
    1.64M
    1y
    Upstage
    Free Endpoint

    solar-10.7b-instruct

    Excels in NLP tasks, particularly in instruction-following, reasoning, and mathematics.
    Model
    Non-Commercial Use Only
    494K
    1y
    Meta
    Downloadable

    llama-3.2-1b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    chat
    42.03K343K
    1y
    Meta
    Downloadable

    llama-3.2-3b-instruct

    Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    24.89K1.15M
    1y
    Meta
    Downloadable

    llama-3.1-8b-instruct

    Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.
    Model
    Chat
    31.14M
    10mo
    Meta
    Downloadable

    llama-3.1-70b-instruct

    Powers complex conversations with superior contextual understanding, reasoning and text generation.
    Model
    Chat
    3.53M
    11mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    10.28K
    1y
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    3.17M
    10mo
    Meta
    Free Endpoint

    llama-4-maverick-17b-128e-instruct

    A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
    Model
    language generation
    21.96M
    10mo
    NVIDIA
    Downloadable

    nemotron-nano-12b-v2-vl

    Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
    Model
    language generation
    2.3M
    7mo
    Microsoft
    Downloadable

    phi-4-mini-instruct

    Lightweight multilingual LLM powering AI applications in latency bound, memory/compute constrained environments
    Model
    Chat
    448K
    1y
    Microsoft
    Free Endpoint

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Model
    Speech Recognition
    394K
    1y
    Abacus.AI
    Free Endpoint

    dracarys-llama-3.1-70b-instruct

    Fine-tuned Llama 3.1 70B model for code generation, summarization, and multi-language tasks.
    Model
    Code Generation
    592K
    1y
    Mistral AI
    DeprecatedDownloadable

    mistral-7b-instruct-v0.3

    This LLM follows instructions, completes requests, and generates creative text.
    Model
    B200
    30.57K
    11mo
    Qwen
    DeprecatedDownloadable

    qwen2.5-coder-32b-instruct

    Advanced LLM for code generation, reasoning, and fixing across popular programming languages.
    Model
    code completion
    1.03M
    11mo
    Retail
    LaunchableDeveloper Example

    Multi-Agent Intelligent Warehouse

    An AI-powered, multi-agent system designed to optimize warehouse operations through intelligent automation, real-time monitoring, and natural language interaction.
    Blueprint
    NVIDIA AI
    3mo