Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

12 results for

Filters

  • Free Endpoint
    3
  • Partner Endpoint
    5
  • Download Available
    5
  • Image Generation
    2
  • Text-to-Image
    2
  • Deepinfra
    3
  • GMI Cloud
    3
  • Bitdeer
    2
  • Together AI
    2
  • Eigen AI
    1
  • NVIDIA
    7
  • Qwen
    5
  • AI Engineer
    3
  • Developer
    3
  • Hpc Developer
    3
  • Ml Engineer
    3
  • AI And Machine Learning
    3
  • B200
    2
  • GB200
    1
  • H100 80GB HBM3
    1
  • H200
    1
  • NeMo Megatron Bridge
    3
  • Qwen
    Downloadable

    qwen-image

    Qwen-Image is a text-to-image foundation model with advanced multilingual text rendering.
    Model
    Text-to-Image
    1mo
    Items per page
    of 1 pages
    Qwen
    Downloadable

    qwen-image-edit

    Qwen-Image-Edit is an image editing model with multilingual text editing and strong subject consistency.
    Model
    Text-to-Image
    1mo
    Qwen
    DownloadableFree Endpoint

    qwen3-next-80b-a3b-instruct

    Qwen3-Next Instruct blends hybrid attention, sparse MoE, and stability boosts for ultra-long context AI.
    Model
    text-generation
    26.97M
    8mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-397b-a17b

    Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
    Model
    MoE
    13.15M
    3mo
    Qwen
    DownloadableFree Endpoint

    qwen3.5-122b-a10b

    122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.
    Model
    tool calling
    10.33M
    3mo
    DGX Spark
    20 MINS

    CLI Coding Agent

    Build local CLI coding agents with Ollama
    Playbook
    Coding
    1mo
    DGX Spark
    60 MIN

    cuTile Kernels

    Run cuTile kernel benchmarks, FMHA implementation, and LLM inference on DGX Spark and B300
    Playbook
    FMHA
    1mo
    DGX Station
    30 MIN

    LLM Inference with SGLang

    Serve LLMs with SGLang on DGX Station (Qwen3-8B default; Qwen3.6 MoE optional)—prefix-cached multi-turn, structured output, benchmarks, and inference-server guidance
    Playbook
    RadixAttention
    17d

    Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from DSV3, Qwen3, Qwen3-Next, and VLM bring-up work.
    Skill
    Developer
    350
    14d

    Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.
    Skill
    Developer
    352
    14d

    Practical guidance for training MoE VLMs in Megatron Bridge. Compares FSDP and 3D-parallel approaches, using rounded lessons from Qwen3-VL, Qwen3-Next, and other multimodal experiments.
    Skill
    Developer
    348
    14d
    DGX Spark
    1 HR

    Vision-Language Model Fine-tuning

    Fine-tune Vision-Language Models for image and video understanding tasks using Qwen2.5-VL and InternVL3
    Playbook
    DGX
    8mo