Skip to content

[Dev] Numerical fix for moe single grouped weight with fp8 fp4 primary weight and grad norm spikes#5464

Open
zhongbozhu wants to merge 19 commits into
NVIDIA:devfrom
zhongbozhu:dev_fix_single_weight
Open

[Dev] Numerical fix for moe single grouped weight with fp8 fp4 primary weight and grad norm spikes#5464
zhongbozhu wants to merge 19 commits into
NVIDIA:devfrom
zhongbozhu:dev_fix_single_weight

Commits

Commits on Jun 26, 2026

Commits on Jun 27, 2026

Commits on Jun 28, 2026

Commits on Jun 29, 2026

Commits on Jun 30, 2026