[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363

zhxchen17 · 2025-06-06T20:20:44Z

Stack from ghstack (oldest at bottom):

[precompile] Add CompilePackage to serialize dynamo states. #155118
-> [precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363
[precompile] Add low level C API to load precompiled dynamo code on functions. #155329

In a precompiled bytecode, it looks like the following:

pre-graph bytecode
...
compiled graph code
...
post-graph bytecode

In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke @disable inside the bytecode.

Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today).

This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation.

Differential Revision: D76156867

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

@disable

…om precompile bytecode. In a precompiled bytecode, it looks like the following: ``` pre-graph bytecode ... compiled graph code ... post-graph bytecode ``` In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke @disable inside the bytecode. Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today). This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation. Differential Revision: [D76156867](https://our.internmc.facebook.com/intern/diff/D76156867/) [ghstack-poisoned]

pytorch-bot · 2025-06-06T20:20:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155363

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 2228142 with merge base be2ad70 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu) (gh) (similar failure)
MISSING REGRESSION TEST

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge, unstable) (gh)
exir/backend/test/test_to_backend_multi_method.py::TestToBackendMultiMethod::test_multi_method_end_to_end

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-06-06T20:20:54Z

This pull request was exported from Phabricator. Differential Revision: D76156867

zhxchen17 · 2025-06-06T20:30:44Z

@pytorchbot rebase -b main

pytorchmergebot · 2025-06-06T20:32:24Z

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

[ghstack-poisoned]

…m precompile bytecode. In a precompiled bytecode, it looks like the following: ``` pre-graph bytecode ... compiled graph code ... post-graph bytecode ``` In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke disable inside the bytecode. Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today). This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation. Differential Revision: [D76156867](https://our.internmc.facebook.com/intern/diff/D76156867/) ghstack-source-id: cb4a327 Pull Request resolved: #155363

pytorchmergebot · 2025-06-06T20:32:39Z

Successfully rebased gh/zhxchen17/24/orig onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/155363)

zhxchen17 · 2025-06-06T20:37:18Z

Also based on #155259

williamwen42

It took me a while to understand why you're removing @functools.wraps(fn) (so maybe a better comment could help), but my understanding is:

in precompile, we sometimes attempt to top-level compile a @disable'd function, but the functools.wraps frame ends up getting traced with the old callback still active, leading to failures under the fail_on_recompile stance.
We need the functools.wraps normally so that convert_frame.py has some knowledge of the original underlying frame (e.g. I believe not including this decorator will impact how trace_rules handles this function).

jansel

Test failures?

zhxchen17 · 2025-06-09T16:45:33Z

It took me a while to understand why you're removing @functools.wraps(fn) (so maybe a better comment could help), but my understanding is:

in precompile, we sometimes attempt to top-level compile a @disable'd function, but the functools.wraps frame ends up getting traced with the old callback still active, leading to failures under the fail_on_recompile stance.

We need the functools.wraps normally so that convert_frame.py has some knowledge of the original underlying frame (e.g. I believe not including this decorator will impact how trace_rules handles this function).

@williamwen42 yes your understand is correct to me. I can add more comments to explain this.

…ecompile from precompile bytecode." In a precompiled bytecode, it looks like the following: ``` pre-graph bytecode ... compiled graph code ... post-graph bytecode ``` In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke disable inside the bytecode. Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today). This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation. Differential Revision: [D76156867](https://our.internmc.facebook.com/intern/diff/D76156867/) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

@disable

…om precompile bytecode. Pull Request resolved: #155363 In a precompiled bytecode, it looks like the following: ``` pre-graph bytecode ... compiled graph code ... post-graph bytecode ``` In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke @disable inside the bytecode. Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today). This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation. Differential Revision: [D76156867](https://our.internmc.facebook.com/intern/diff/D76156867/) ghstack-source-id: 289152993

facebook-github-bot · 2025-06-09T17:11:06Z

This pull request was exported from Phabricator. Differential Revision: D76156867

zhxchen17 · 2025-06-10T16:03:18Z

@pytorchbot merge

pytorchmergebot · 2025-06-10T16:07:37Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

facebook-github-bot · 2025-06-10T20:30:44Z

This pull request was exported from Phabricator. Differential Revision: D76156867

zhxchen17 mentioned this pull request Jun 6, 2025

[precompile] Add low level C API to load precompiled dynamo code on functions. #155329

Closed

pytorch-bot bot added ciflow/inductor module: dynamo labels Jun 6, 2025

facebook-github-bot added the fb-exported label Jun 6, 2025

zhxchen17 requested review from jansel, anijain2305 and jamesjwu June 6, 2025 20:20

zhxchen17 added the topic: not user facing topic category label Jun 6, 2025

Update

e7ad70b

[ghstack-poisoned]

anijain2305 requested a review from williamwen42 June 6, 2025 20:36

jamesjwu approved these changes Jun 6, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 6, 2025

williamwen42 reviewed Jun 6, 2025

View reviewed changes

jansel reviewed Jun 7, 2025

View reviewed changes

zhxchen17 requested review from williamwen42 and jansel June 9, 2025 17:12

zhxchen17 mentioned this pull request Jun 7, 2025

[precompile] Add CompilePackage to serialize dynamo states. #155118

Closed

jansel approved these changes Jun 10, 2025

View reviewed changes

pytorchmergebot added the merging label Jun 10, 2025

pytorchmergebot added the Merged label Jun 10, 2025

pytorchmergebot closed this in 38c4d05 Jun 10, 2025

pytorchmergebot removed the merging label Jun 10, 2025

This was referenced Jun 10, 2025

[precompile] Implement PrecompileContext for recording precompile artifacts, integrate with CompilePackage #154415

Closed

[Precompile] Hook up backend="inductor" #155387

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363

[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363

Uh oh!

zhxchen17 commented Jun 6, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 6, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 6, 2025

Uh oh!

zhxchen17 commented Jun 6, 2025

Uh oh!

pytorchmergebot commented Jun 6, 2025

Uh oh!

pytorchmergebot commented Jun 6, 2025

Uh oh!

zhxchen17 commented Jun 6, 2025

Uh oh!

williamwen42 left a comment

Uh oh!

jansel left a comment

Uh oh!

zhxchen17 commented Jun 9, 2025

Uh oh!

facebook-github-bot commented Jun 9, 2025

Uh oh!

zhxchen17 commented Jun 10, 2025

Uh oh!

pytorchmergebot commented Jun 10, 2025

Uh oh!

facebook-github-bot commented Jun 10, 2025

Uh oh!

Uh oh!

[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363

[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. #155363

Uh oh!

Conversation

zhxchen17 commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155363

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

facebook-github-bot commented Jun 6, 2025

Uh oh!

zhxchen17 commented Jun 6, 2025

Uh oh!

pytorchmergebot commented Jun 6, 2025

Uh oh!

pytorchmergebot commented Jun 6, 2025

Uh oh!

zhxchen17 commented Jun 6, 2025

Uh oh!

williamwen42 left a comment

Choose a reason for hiding this comment

Uh oh!

jansel left a comment

Choose a reason for hiding this comment

Uh oh!

zhxchen17 commented Jun 9, 2025

Uh oh!

facebook-github-bot commented Jun 9, 2025

Uh oh!

zhxchen17 commented Jun 10, 2025

Uh oh!

pytorchmergebot commented Jun 10, 2025

Merge started

Uh oh!

facebook-github-bot commented Jun 10, 2025

Uh oh!

Uh oh!

zhxchen17 commented Jun 6, 2025 •

edited

Loading

pytorch-bot bot commented Jun 6, 2025 •

edited

Loading