Skip to content

[DTensor] Assert DTensorSpec has valid placements #158133

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: gh/wconstab/424/base
Choose a base branch
from

Conversation

wconstab
Copy link
Contributor

@wconstab wconstab commented Jul 11, 2025

Stack from ghstack (oldest at bottom):

This helped identify buggy sharding rules during debugging, why not
check it in.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @d4l3k @pragupta

This helped identify buggy sharding rules during debugging, why not
check it in.

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Jul 11, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158133

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 1 New Failure, 1 Unrelated Failure

As of commit 7ed15e9 with merge base b4476ca (image):

NEW FAILURE - The following job has failed:

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added ciflow/inductor oncall: distributed Add this issue/PR to distributed oncall triage queue labels Jul 11, 2025
wconstab added a commit that referenced this pull request Jul 11, 2025
This helped identify buggy sharding rules during debugging, why not
check it in.

ghstack-source-id: 8d89ee5
Pull Request resolved: #158133
This helped identify buggy sharding rules during debugging, why not
check it in.

cc H-Huang awgu wanchaol fegin fduwjj wz337 d4l3k

[ghstack-poisoned]
wconstab added a commit that referenced this pull request Jul 11, 2025
This helped identify buggy sharding rules during debugging, why not
check it in.

ghstack-source-id: 5c69057
Pull Request resolved: #158133
@wconstab wconstab added the topic: not user facing topic category label Jul 11, 2025
@wconstab
Copy link
Contributor Author

wconstab commented Jul 11, 2025

hmm, this looks (?) like a real bug in some dtensor code, better go fix it
image

edit: this was a trivial bug in the unit test itself, fixed

Copy link
Contributor

@XilunWu XilunWu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Wonder if we have enforced this equation everywhere in DTensor.

This helped identify buggy sharding rules during debugging, why not
check it in.

cc H-Huang awgu wanchaol fegin fduwjj wz337 d4l3k

[ghstack-poisoned]
wconstab added a commit that referenced this pull request Jul 11, 2025
This helped identify buggy sharding rules during debugging, why not
check it in.

ghstack-source-id: 3fde8ba
Pull Request resolved: #158133
@wconstab
Copy link
Contributor Author

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 16, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@ZainRizvi
Copy link
Contributor

@pytorchbot revert -c ghfirst -m "Sorry but this is breaking internally. See D78496151 for details. To validate your fixes internally, you can follow the instructions here: https://fburl.com/fixing-ghfirst-reverts"

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

@pytorchmergebot
Copy link
Collaborator

@wconstab your PR has been successfully reverted.

pytorchmergebot added a commit that referenced this pull request Jul 21, 2025
This reverts commit 1839e8d.

Reverted #158133 on behalf of https://github.com/ZainRizvi due to Sorry but this is breaking internally. See D78496151 for details. To validate your fixes internally, you can follow the instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](#158133 (comment)))
@pytorchmergebot pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Jul 21, 2025
facebook-github-bot pushed a commit that referenced this pull request Jul 24, 2025
Summary:
This helped identify buggy sharding rules during debugging, why not
check it in.

Approved by: https://github.com/XilunWu, https://github.com/zpcore
ghstack dependencies: #158132

Test Plan:
contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/1839e8d04b81ee6eda0cff6fbfc218a7a600f6f7

Rollback Plan:

Differential Revision: D78929245
facebook-github-bot pushed a commit that referenced this pull request Jul 24, 2025
Summary:
This helped identify buggy sharding rules during debugging, why not
check it in.

Test Plan:
contbuild & OSS CI

Rollback Plan:

Differential Revision: D78929245
@iremyux
Copy link
Collaborator

iremyux commented Aug 6, 2025

Adding ciflow/win-arm64 label to trigger Windows Arm64 CI and its test purposes - nothing about this PR specifically. (It should not effect the acceptance of the PR even if it fails.)

@iremyux iremyux added the ciflow/win-arm64 Trigger Windows Arm64 CI Workflows label Aug 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci-no-td Do not run TD on this PR ciflow/inductor ciflow/trunk Trigger trunk jobs on your pull request ciflow/win-arm64 Trigger Windows Arm64 CI Workflows Merged oncall: distributed Add this issue/PR to distributed oncall triage queue Reverted topic: not user facing topic category
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants