Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

3 results for

Filters

  • NVIDIA
    3
  • AI Engineer
    3
  • Developer
    3
  • Hpc Developer
    3
  • Ml Engineer
    3
  • AI And Machine Learning
    3
  • NeMo Megatron Bridge
    3
  • Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from DSV3, Qwen3, Qwen3-Next, and VLM bring-up work.
    Skill
    Developer
    350
    14d

    Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.
    Skill
    Developer
    352
    14d

    Practical guidance for training MoE VLMs in Megatron Bridge. Compares FSDP and 3D-parallel approaches, using rounded lessons from Qwen3-VL, Qwen3-Next, and other multimodal experiments.
    Skill
    Developer
    348
    14d
    Items per page
    of 1 pages