Skip to main content
NVIDIA
Explore
Models
Skills
Blueprints
GPUs
Docs
⌘KCtrl+K
Terms of Use
Privacy Policy
Your Privacy Choices
Contact

Copyright © 2026 NVIDIA Corporation

7 results for

Filters

  • NVIDIA
    7
  • Developer
    7
  • AI Engineer
    6
  • Ml Engineer
    6
  • Hpc Developer
    5
  • DevOps Engineer
    1
  • AI And Machine Learning
    7
  • NeMo Megatron Bridge
    4
  • Megatron Core
    2
  • NeMoClaw
    1
  • Practical guidance for training MoE VLMs in Megatron Bridge. Compares FSDP and 3D-parallel approaches, using rounded lessons from Qwen3-VL, Qwen3-Next, and other multimodal experiments.
    Skill
    Developer
    91
    5d

    Run Megatron-LM (MLM) and Megatron Bridge training with mock or real data. Covers correlation testing, available recipes, and multi-GPU examples.
    Skill
    Developer
    91
    5d

    Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from DSV3, Qwen3, Qwen3-Next, and VLM bring-up work.
    Skill
    Developer
    92
    5d

    Validate and use packed sequences and long-context training in Megatron-Bridge, distinguishing offline packed SFT for LLMs from in-batch packing for VLMs, and applying the right CP constraints.
    Skill
    Developer
    91
    5d
    Items per page
    of 1 pages

    Linting and formatting for Megatron-LM. Covers running autoformat.sh, tools (ruff, black, isort, pylint, mypy), and code style rules.
    Skill
    Developer
    87
    3d

    How to launch distributed Megatron-LM training jobs on a SLURM cluster. Covers a minimal sbatch skeleton, environment-variable setup for torch.distributed.run, CUDA_DEVICE_MAX_CONNECTIONS rules across hardware and parallelism modes, container conventions,
    Skill
    Developer
    86
    3d

    Explains how to run NemoClaw on a remote GPU instance, including the deprecated Brev compatibility path and the preferred installer plus onboard flow. Use when deploying NemoClaw to a remote VM, onboarding a Brev instance, or migrating away from the legac
    Skill
    Developer
    162
    5d