NVIDIA
Explore
Models
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

23 results for

Filters

  • API Endpoint
    15
  • Download Available
    8
  • Code Generation
    8
  • Image-to-Text
    5
  • Google
    8
  • Microsoft
    7
  • NVIDIA
    4
  • Meta
    2
  • AI21 Labs
    1
  • NVIDIA

    llama-3.1-nemotron-nano-4b-v1.1

    State-of-the-art open model for reasoning, code, math, and tool calling - suitable for edge agents
    Model
    edge
    98.31K
    8mo
    Microsoft

    phi-4-mini-flash-reasoning

    Lightweight reasoning model for applications in latency bound, memory/compute constrained environments
    Model
    edge
    468K
    7mo
    Google

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    chat
    508K
    9mo
    Mistral AI

    magistral-small-2506

    High performance reasoning model optimized for efficiency and edge deployment
    Model
    coding
    4.05M
    8mo
    Google

    gemma-2-27b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    652K
    9mo
    Google

    gemma-2-9b-it

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    4.35M
    9mo
    Google

    gemma-3-1b-it

    A lightweight, multilingual, advanced SLM text model for edge computing, resource constraint applications
    Model
    Translation
    4.57K512K
    9mo
    Google

    gemma-3-27b-it

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    Vision Assistant
    5.6M
    9mo
    Google

    gemma-7b

    Cutting-edge text generation model text understanding, transformation, and code generation.
    Model
    chat
    502K
    9mo
    AI21 Labs

    jamba-1.5-mini-instruct

    Cutting-edge MOE based LLM designed to excel in a wide array of generative AI tasks.
    Model
    chat
    510K
    9mo
    NVIDIA

    llama-3.1-nemotron-nano-8b-v1

    Leading reasoning and agentic AI accuracy model for PC and edge.
    Model
    chat
    606K
    8mo
    Meta

    llama-3.2-11b-vision-instruct

    Cutting-edge vision-language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    711K
    9mo
    Meta

    llama-3.2-90b-vision-instruct

    Cutting-edge vision-Language model exceling in high-quality reasoning from images.
    Model
    Image-Text Retrieval
    582K
    9mo
    NVIDIA

    nemoretriever-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    optical character recognition
    295K
    9mo
    NVIDIA

    nemotron-parse

    Cutting-edge vision-language model exceling in retrieving text and metadata from images.
    Model
    text and table extraction
    432K
    4mo
    Microsoft

    phi-3-medium-128k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    492K
    9mo
    Microsoft

    phi-3-medium-4k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    484K
    9mo
    Microsoft

    phi-3-small-128k-instruct

    Long context cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    590K
    9mo
    Microsoft

    phi-3-small-8k-instruct

    Cutting-edge lightweight open language model exceling in high-quality reasoning.
    Model
    chat
    487K
    9mo
    Microsoft

    phi-3.5-vision-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from images.
    Model
    Vision Assistant
    535K
    1y
    Microsoft

    phi-4-multimodal-instruct

    Cutting-edge open multimodal model exceling in high-quality reasoning from image and audio inputs.
    Model
    Speech Recognition
    462K
    9mo
    Google

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    632K
    7mo
    Google

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    695K
    7mo
    Items per page
    of 1 pages