NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

20 results for

Filters (1)

  • API Endpoint
    14
  • Download Available
    6
  • Code Generation
    8
  • Image-to-Text
    4
  • Google
    8
  • Microsoft
    6
  • NVIDIA
    2
  • Meta
    2
  • AI21 Labs
    1
  • chat
  • NVIDIA

    llama-3.1-nemotron-nano-4b-v1.1

    State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
    Model
    edge
    101K
    8mo
    Microsoft

    phi-4-mini-flash-reasoning

    Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
    Model
    edge
    491K
    7mo
    Google

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    531K
    9mo
    Mistral AI

    magistral-small-2506

    High performance reasoning model optimized for efficiency and edge deployment
    Model
    coding
    4.16M
    8mo
    Google

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    678K
    9mo
    Google

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    4.47M
    9mo
    Google

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    Translation
    4.59K531K
    9mo
    Google

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    Vision Assistant
    5.69M
    9mo
    Google

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    532K
    9mo
    AI21 Labs

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    536K
    9mo
    NVIDIA

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    chat
    631K
    8mo
    Meta

    llama-3.2-11b-vision-instruct

    Cutting-edge vision-language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    750K
    9mo
    Meta

    llama-3.2-90b-vision-instruct

    Cutting-edge vision-Language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    607K
    9mo
    Microsoft

    phi-3-medium-128k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    515K
    9mo
    Microsoft

    phi-3-medium-4k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    507K
    9mo
    Microsoft

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    613K
    9mo
    Microsoft

    phi-3-small-8k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    510K
    9mo
    Microsoft

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Model
    Speech Recognition
    490K
    9mo
    Google

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    658K
    7mo
    Google

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    718K
    7mo
    Items per page
    of 1 pages