Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

9 results for

Filters

  • Free Endpoint
    5
  • Partner Endpoint
    4
  • Download Available
    7
  • Image Generation
    2
  • Text-to-Image
    2
  • Image-to-Text
    1
  • Optical Character Recognition
    1
  • Synthetic Data Generation
    1
  • Deepinfra
    3
  • GMI Cloud
    2
  • OpenRouter
    2
  • Together AI
    2
  • Bitdeer
    1
  • Qwen
    3
  • NVIDIA
    2
  • Google
    1
  • Microsoft
    1
  • Mistral AI
    1
  • B200
    1
  • GB200
    1
  • Qwen
    Downloadable

    qwen-image

    Qwen-Image is a text-to-image foundation model with advanced multilingual text rendering.
    Model
    Text-to-Image
    1mo
    Items per page
    of 1 pages
    Mistral AI
    DownloadableFree Endpoint

    mistral-small-4-119b-2603

    Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context
    Model
    code generation
    13M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    10M
    3mo
    NVIDIA
    Free Endpoint

    cosmos3-nano

    Generates physics-aware videos from text prompts or an image prompt for physical AI development.
    Model
    autonomous vehicles
    2K
    25d
    Microsoft
    Downloadable

    TRELLIS

    MSFT TRELLIS is a 3D AI model that generates high-quality 3D assets from text or image inputs.
    Model
    text-to-3d
    4K
    9mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    13M
    4mo
    Google
    Free Endpoint

    paligemma

    Vision language model adept at comprehending text and visual inputs to produce informative responses
    Model
    image
    10K
    1y
    Stability AI
    Downloadable

    stable-diffusion-3.5-large

    Stable Diffusion 3.5 is a popular text-to-image generation model
    Model
    Text-to-Image
    10mo
    NVIDIA
    Downloadable

    nemotron-ocr-v2

    Nemotron OCR v2 is a state-of-the-art multilingual text recognition model designed for robust end-to-end optical character recognition (OCR) on complex real-world images.
    Model
    Table Extraction
    151
    1d