Skip to content

Pull requests: NVIDIA-NeMo/Megatron-Bridge

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(ci): make run_ci_tests usable with current launch scripts area:ci ci CI, automation, test queue, or workflow infrastructure work needs-review PR is ready for code review and waiting on a reviewer
#4624 opened Jul 2, 2026 by yaoyu-33 Contributor Loading…
refactor(performance): remove legacy perf configs area:perf Performance optimizations and benchmarking breaking-change Public behavior or API compatibility changes feature New capabilities, enhancements, or enablement work full-test-suite needs-more-tests Requires additional L0 and L1 test coverage before merge waiting-on-customer Waiting on the original author to respond
#4623 opened Jul 2, 2026 by yaoyu-33 Contributor Loading…
refactor(recipes): add h100 recipe namespace area:recipe Training recipes and launch configs breaking-change Public behavior or API compatibility changes feature New capabilities, enhancements, or enablement work full-test-suite needs-review PR is ready for code review and waiting on a reviewer
#4622 opened Jul 2, 2026 by yaoyu-33 Contributor Loading…
Update Nemotron 3 Super B200 BF16 config area:perf Performance optimizations and benchmarking feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer
#4621 opened Jul 1, 2026 by zuriz-nv Loading…
refactor(scripts): share performance recipe runner area:perf Performance optimizations and benchmarking area:recipe Training recipes and launch configs area:training Training loop, callbacks, and runtime integration breaking-change Public behavior or API compatibility changes feature New capabilities, enhancements, or enablement work full-test-suite needs-more-tests Requires additional L0 and L1 test coverage before merge needs-review PR is ready for code review and waiting on a reviewer
#4620 opened Jul 1, 2026 by yaoyu-33 Contributor Loading…
ci: AUT-665 bump Dockerfile.ci base image to pytorch 26.06 area:build Dependencies, packaging, images, and environment setup build ci CI, automation, test queue, or workflow infrastructure work full-test-suite needs-review PR is ready for code review and waiting on a reviewer
#4617 opened Jul 1, 2026 by svcnemo-autobot Collaborator Loading…
feat(data): intra-microbatch reordering for MegatronMIMO (+ sequence packing, scalable DP) area:data Dataset builders, preprocessing, and samplers community-request feature New capabilities, enhancements, or enablement work full-test-suite waiting-on-customer Waiting on the original author to respond
#4608 opened Jul 1, 2026 by sailor1493 Loading…
4 tasks done
fix(training): reject unsupported local CUDA graph scopes area:perf Performance optimizations and benchmarking bug Something isn't working ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#4607 opened Jul 1, 2026 by yaoyu-33 Contributor Loading…
perf(qwen): enable full iteration cg for b200 b300 fp8 mx area:perf Performance optimizations and benchmarking feature New capabilities, enhancements, or enablement work ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#4606 opened Jun 30, 2026 by rhmukundan Contributor Loading…
fix(data): make finetuning batch sampler epoch-aware on checkpoint resume area:data Dataset builders, preprocessing, and samplers bug Something isn't working community-request ready-to-merge PR is approved, current, and only waiting for CI to pass before merge
#4601 opened Jun 30, 2026 by Achyuthan-S Loading…
5 tasks done
[build] chore: revert "bump transformer-engine to release_v2.16.post (#4536)" area:build Dependencies, packaging, images, and environment setup ci CI, automation, test queue, or workflow infrastructure work full-test-suite needs-more-tests Requires additional L0 and L1 test coverage before merge needs-review PR is ready for code review and waiting on a reviewer
#4600 opened Jun 30, 2026 by ko3n1g Contributor Loading…
Revert "[build] chore: bump transformer-engine to release_v2.16.post (#4536)" area:build Dependencies, packaging, images, and environment setup ci:build ci CI, automation, test queue, or workflow infrastructure work dependencies full-test-suite needs-more-tests Requires additional L0 and L1 test coverage before merge needs-review PR is ready for code review and waiting on a reviewer
#4599 opened Jun 30, 2026 by svcnemo-autobot Collaborator Loading…
Fix packed-sequence SFT prep edge cases for long-context THD+CP (#4593 §2.3-2.5) area:data Dataset builders, preprocessing, and samplers bug Something isn't working community-request needs-more-tests Requires additional L0 and L1 test coverage before merge waiting-on-customer Waiting on the original author to respond x-perplexity External request: Perplexity
#4598 opened Jun 30, 2026 by sen-ppl Loading…
Mbridge loaders
#4594 opened Jun 30, 2026 by maanug-nv Contributor Draft
5 tasks
Per-layer expert count + MoE aux-loss tracker sizing (the NCCL-deadlo… area:training Training loop, callbacks, and runtime integration bug Something isn't working community-request needs-more-tests Requires additional L0 and L1 test coverage before merge waiting-on-customer Waiting on the original author to respond
#4592 opened Jun 30, 2026 by chochowski Contributor Loading…
feat(quant): Add modelopt KV cache amax mapping. area:quant Quantization (PTQ, QAT, FP8 recipes) feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer
#4591 opened Jun 30, 2026 by mxinO Contributor Loading…
4 of 5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.