[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

njriasan · 2025-08-04T17:38:10Z

Summary: Inductor's 3.4 Triton release is the most common used variant of Triton, but if someone is working with an alternative version of Triton this may not match. This moves the version check from 3.4 Triton to any variant that has support for the TMA APIs.

Test Plan:
Relying on CI. Should be a NFC.

Rollback Plan:

Reviewed By: davidberard98

Differential Revision: D79378792

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-08-04T17:38:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159777

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ghstack-mergeability-check and Check labels failing with 'Resource not accessible by integration'

❌ 1 New Failure, 1 Unrelated Failure

As of commit 4d92666 with merge base a7f3bdf ():

NEW FAILURE - The following job has failed:

rocm-mi300 / linux-noble-rocm-py3.12-mi300 / test (default, 3, 6, linux.rocm.gpu.gfx942.2) (gh)
inductor/test_torchinductor_strided_blocks.py::TritonTensorDescriptorTestCUDA::test_2d_reduction_no_x_dim_cuda

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, lf.linux.12xlarge, unstable) (gh) (#158876)
/var/lib/jenkins/workspace/xla/torch_xla/csrc/runtime/BUILD:476:14: Compiling torch_xla/csrc/runtime/xla_util_test.cpp failed: (Exit 1): gcc failed: error executing CppCompile command (from target //torch_xla/csrc/runtime:xla_util_test) /usr/bin/gcc -U_FORTIFY_SOURCE -fstack-protector -Wall -Wunused-but-set-parameter -Wno-free-nonheap-object -fno-omit-frame-pointer -g0 -O2 '-D_FORTIFY_SOURCE=1' -DNDEBUG -ffunction-sections ... (remaining 229 arguments skipped)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-08-04T17:38:37Z

This pull request was exported from Phabricator. Differential Revision: D79378792

facebook-github-bot · 2025-08-04T17:50:39Z

This pull request was exported from Phabricator. Differential Revision: D79378792

Summary: Pull Request resolved: pytorch#159777 Inductor's 3.4 Triton release is the most common used variant of Triton, but if someone is working with an alternative version of Triton this may not match. This moves the version check from 3.4 Triton to any variant that has support for the TMA APIs. Test Plan: Relying on CI. Should be a NFC. Rollback Plan: Reviewed By: davidberard98 Differential Revision: D79378792

davidberard98

it needs to be linted, otherwise lgtm & thanks for the fix!

Summary: Inductor's 3.4 Triton release is the most common used variant of Triton, but if someone is working with an alternative version of Triton this may not match. This moves the version check from 3.4 Triton to any variant that has support for the TMA APIs. Test Plan: Relying on CI. Should be a NFC. Rollback Plan: Reviewed By: davidberard98 Differential Revision: D79378792

Summary: Pull Request resolved: pytorch#159777 Inductor's 3.4 Triton release is the most common used variant of Triton, but if someone is working with an alternative version of Triton this may not match. This moves the version check from 3.4 Triton to any variant that has support for the TMA APIs. Test Plan: Relying on CI. Should be a NFC. Rollback Plan: Reviewed By: davidberard98 Differential Revision: D79378792

facebook-github-bot · 2025-08-04T22:21:19Z

This pull request was exported from Phabricator. Differential Revision: D79378792

facebook-github-bot · 2025-08-05T03:21:36Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-08-05T03:23:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

izaitsevfb · 2025-08-05T21:58:28Z

@pytorchbot revert -m "breaking inductor test on ROCm" -c nosignal

FAILED [2.6870s] inductor/test_torchinductor_strided_blocks.py::TritonTensorDescriptorTestCUDA::test_welford_non_block_pointer_cuda - AssertionError: Scalars are not equal!
caused by this PR:
https://github.com/pytorch/pytorch/pull/159777
[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

hud link:
https://hud.pytorch.org/hud/pytorch/pytorch/main/1?per_page=50&name_filter=rocm%20%2F&mergeEphemeralLF=true

pytorchmergebot · 2025-08-05T22:00:14Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2025-08-05T22:00:27Z

@njriasan your PR has been successfully reverted.

)" This reverts commit bbc0df1. Reverted #159777 on behalf of https://github.com/izaitsevfb due to breaking inductor test on ROCm ([comment](#159777 (comment)))

facebook-github-bot · 2025-08-05T22:53:26Z

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

pytorchmergebot · 2025-08-05T22:55:25Z

Merge started

Your change will be merged while ignoring the following 1 checks: rocm-mi300 / linux-noble-rocm-py3.12-mi300 / test (default, 3, 6, linux.rocm.gpu.gfx942.2)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-08-05T22:55:43Z

Merge failed

Reason: 1 jobs have failed, first few of them are: Meta Internal-Only Changes Check

Details for Dev Infra team

Raised by workflow job

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 4, 2025

facebook-github-bot added the fb-exported label Aug 4, 2025

njriasan force-pushed the export-D79378792 branch from 011cfd4 to f62fea1 Compare August 4, 2025 17:50

njriasan added the topic: not user facing topic category label Aug 4, 2025

njriasan requested a review from davidberard98 August 4, 2025 17:53

davidberard98 approved these changes Aug 4, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 4, 2025

njriasan force-pushed the export-D79378792 branch from f62fea1 to 835199e Compare August 4, 2025 22:16

njriasan force-pushed the export-D79378792 branch from 835199e to 4d92666 Compare August 4, 2025 22:21

pytorchmergebot added the merging label Aug 5, 2025

pytorchmergebot closed this in bbc0df1 Aug 5, 2025

pytorchmergebot added Merged and removed merging labels Aug 5, 2025

jithunnair-amd added the ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 label Aug 5, 2025

pytorchmergebot added the Reverted label Aug 5, 2025

pytorchmergebot added the ci-no-td Do not run TD on this PR label Aug 5, 2025

pytorchmergebot reopened this Aug 5, 2025

pytorchmergebot added the merging label Aug 5, 2025

pytorchmergebot removed the merging label Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

Uh oh!

njriasan commented Aug 4, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

davidberard98 left a comment

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

izaitsevfb commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

facebook-github-bot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

Uh oh!

[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

Are you sure you want to change the base?

[Inductor][Triton] Support TMA before strict 3.4 cutoff #159777

Uh oh!

Conversation

njriasan commented Aug 4, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159777

❗ 1 Active SEVs

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

davidberard98 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Merge started

Uh oh!

izaitsevfb commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Uh oh!

facebook-github-bot commented Aug 5, 2025

Uh oh!

pytorchmergebot commented Aug 5, 2025

Merge started

Uh oh!

pytorchmergebot commented Aug 5, 2025

Merge failed

Uh oh!

Uh oh!

njriasan commented Aug 4, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 4, 2025 •

edited

Loading