Skip to content

ci: AUT-665 bump Dockerfile.ci base image to pytorch 26.06#4617

Open
svcnemo-autobot wants to merge 6 commits into
NVIDIA-NeMo:mainfrom
svcnemo-autobot:ci/implement-588a1e0ebfef
Open

ci: AUT-665 bump Dockerfile.ci base image to pytorch 26.06#4617
svcnemo-autobot wants to merge 6 commits into
NVIDIA-NeMo:mainfrom
svcnemo-autobot:ci/implement-588a1e0ebfef

Conversation

@svcnemo-autobot

Copy link
Copy Markdown
Collaborator

Bumped the CI base image in docker/Dockerfile.ci from pytorch:26.04-py3 to pytorch:26.06-py3.

@svcnemo-autobot svcnemo-autobot requested a review from a team as a code owner July 1, 2026 19:22
@copy-pr-bot

copy-pr-bot Bot commented Jul 1, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@svcnemo-autobot svcnemo-autobot added build ci CI, automation, test queue, or workflow infrastructure work labels Jul 1, 2026
@svcnemo-autobot

Copy link
Copy Markdown
Collaborator Author

/ok to test fe806d3

@ko3n1g

ko3n1g commented Jul 1, 2026

Copy link
Copy Markdown
Contributor

@svcnemo-autobot check for all other references of this base image and bump them to 26.06 too

@ko3n1g

ko3n1g commented Jul 1, 2026

Copy link
Copy Markdown
Contributor

@svcnemo-autobot check for all other references of nvcr.io/nvidia/pytorch and bump them to 26.06 too

@svcnemo-autobot

Copy link
Copy Markdown
Collaborator Author

🤖 Applied your feedback — pushed 7cebf29. Re-running CI.

@svcnemo-autobot

Copy link
Copy Markdown
Collaborator Author

/ok to test 7cebf29

@svcnemo-autobot svcnemo-autobot added the docs Documentation-only updates or documentation debt label Jul 1, 2026
@svcnemo-autobot

Copy link
Copy Markdown
Collaborator Author

🤖 No code change needed — The only failing check is the L0_Launch_training_local_checkpointing GPU functional test, which timed out after 35 minutes during NCCL init with mlx5 devx "No space left on device" hardware warnings — an infrastructure flake unrelated to the pytorch base-image version bump; the container built and ran successfully, so no code change is warranted.

@yaoyu-33 yaoyu-33 added the area:build Dependencies, packaging, images, and environment setup label Jul 1, 2026
@yaoyu-33 yaoyu-33 added full-test-suite needs-review PR is ready for code review and waiting on a reviewer and removed docs Documentation-only updates or documentation debt labels Jul 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:build Dependencies, packaging, images, and environment setup build ci CI, automation, test queue, or workflow infrastructure work full-test-suite needs-review PR is ready for code review and waiting on a reviewer

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants