Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

6 results for

Filters (1)

  • Free Endpoint
    6
  • Partner Endpoint
    3
  • Download Available
    3
  • Code Generation
    1
  • Drug Discovery
    0
  • Together AI
    3
  • Deepinfra
    2
  • GMI Cloud
    2
  • Lightning AI
    2
  • Bitdeer
    1
  • Google
    5
  • OpenAI
    1
  • MIT
    0
  • NVIDIA
    0
  • B200
    1
  • H200
    1
  • L40S
    1
  • A100 SXM4 80GB
    0
  • B300 SXM6 AC
    0
  • Chat
  • Google
    DownloadableFree Endpoint

    diffusiongemma-26b-a4b-it

    Diffusion-based 26B parameter LLM enabling parallel token generation for real-time text apps
    Model
    diffusion-llm
    97.31K
    5d
    Items per page
    of 1 pages
    Google
    Free Endpoint

    gemma-2-2b-it

    Advanced small language generative AI model for edge applications
    Model
    Chat
    1.42M
    1y
    Google
    Free Endpoint

    gemma-3n-e2b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    33.75M
    11mo
    Google
    Free Endpoint

    gemma-3n-e4b-it

    An edge computing AI model which accepts text, audio and image input, ideal for resource-constrained environments
    Model
    language generation
    3.79M
    11mo
    Google
    DownloadableFree Endpoint

    gemma-4-31b-it

    Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.
    Model
    reasoning
    5.49M
    2mo
    OpenAI
    DownloadableFree Endpoint

    gpt-oss-120b

    Mixture of Experts (MoE) reasoning LLM (text-only) designed to fit within 80GB GPU.
    Model
    reasoning
    57.12M
    10mo