foreach_map_fn: add UX fallback for torch.mm to avoid Dynamo graph breaks #159757

Quantum-Kayak · 2025-08-04T06:07:17Z

This PR adds special handling to foreach_map_fn to support torch.mm (matrix multiplication) with list arguments. If torch.mm is passed, it now unpacks two argument lists and applies mm elementwise.

This prevents graph breaks in Dynamo when torch.mm is used in a foreach_map context.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela

pytorch-bot · 2025-08-04T06:07:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159757

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d1f887c with merge base fb887c3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2025-08-04T06:07:21Z

The committers listed above are authorized under a signed CLA.

✅ login: Quantum-Kayak / name: Reynold (d1f887c, 0cd4d70)

Quantum-Kayak · 2025-08-04T06:18:41Z

@pytorchbot label "topic: not user facing"

Quantum-Kayak · 2025-08-05T18:54:01Z

Hey, just confirming this PR is ready for review. It's fully updated, tests are passing (with unrelated flakes), and I'm done making changes. Let me know if anything else is needed!

mlazos · 2025-08-06T00:45:52Z

This is cool! Could you add a test? Also does this work without the special handling? Your addition is very similar to the code below it, and perhaps this was already working? Either way adding a test for this scenario would be great and then I'll approve!

Quantum-Kayak · 2025-08-06T21:59:26Z

Hi @mlazos — just wanted to share a quick update. I’ve started working on the fix for handling matmuls within foreach_map. Currently validating the fallback logic and ensuring it integrates cleanly without affecting existing behavior. I’ll update the PR once things are stable.

Please let me know if there are any specific edge cases or requirements I should keep in mind.

Thanks!

Quantum-Kayak · 2025-08-07T01:04:11Z

Added the test! Let me know if anything else is needed.

janeyx99

Does this change actually improve perf? foreach_map should be a perf optimization, so I'd expect to see benchmarks.

janeyx99 · 2025-08-07T18:22:10Z

test/inductor/test_foreach_map.py

+        expected = [torch.mm(a, b) for a, b in zip(a_list, b_list)]
+
+        for r, e in zip(result, expected):
+            self.assertTrue(torch.allclose(r, e), msg=f"Expected {e}, got {r}")


Suggested change

self.assertTrue(torch.allclose(r, e), msg=f"Expected {e}, got {r}")

self.assertEqual(r, e)

janeyx99 · 2025-08-07T18:23:11Z

torch/_dynamo/polyfills/__init__.py

+    if op is torch.mm:
+        if len(new_args) != 2:
+            raise ValueError("torch.mm requires exactly two argument lists")
+        return [torch.mm(a, b) for a, b in zip(new_args[0], new_args[1])]


wait...does this just forloop over torch.mm with no optimization...

Yeah this is just a UX improvement so that we no longer need to workaround the matmul. (we can basically just pass a whole optim single-tensor implementation, vs breaking it up into multiple foreach_map calls to workaround the matmul.

janeyx99 · 2025-08-07T18:23:45Z

torch/_foreach_where.py

looks irrelevant to this change, let's address foreach_where separately.

janeyx99 · 2025-08-07T18:23:54Z

torch/optim/swa_utils.py

not related

Quantum-Kayak · 2025-08-08T05:12:31Z

Hi @janeyx99, thanks for the detailed review!

I've removed the unrelated changes (_foreach_where.py, swa_utils.py) from this PR — the branch now contains only the relevant fallback logic and its test for torch.mm in foreach_map_fn.

Let me know if you'd like me to add benchmarks for this path. Based on your earlier comment, I understand performance is a key concern here — happy to provide numbers or additional tests if needed.

Thanks again for your time!

janeyx99

It does not appear that you've removed irrelevant changes. And yes, please run some benchmarks comparing against just a for loop of mms.

Quantum-Kayak · 2025-08-09T08:20:32Z

Thanks for your patience — I’ve cleaned up the branch so it only includes the torch.mm fallback and its test. Sorry about the earlier mess; should be all set for review now.

janeyx99

Im still not sure this helps with any performance, but I’ll let @mlazos finish the review as he’s the expert with foreach_map!

mlazos · 2025-08-11T20:35:10Z

Im still not sure this helps with any performance, but I’ll let @mlazos finish the review as he’s the expert with foreach_map!

Yeah this won't improve perf necessarily over generic torch.compile, this is a UX improvement so we can have cleaner code with a whole optimizer loop w/ GEMMs in it.

In the future we can change the lowering to use grouped gemm or write a foreach_mm kernel if applicable.

mlazos · 2025-08-12T05:32:02Z

torch/_dynamo/polyfills/__init__.py

@@ -318,6 +318,12 @@ def foreach_map_fn(*args):
    if not at_least_one_list:
        return op(*args[1:])

+    # Special handling for torch.mm


So before this PR it was an open question whether this was needed or not, can you check what happens in your test without the special handling? It's possible this was already working, but it just needed a test.

pytorch-bot bot added the module: dynamo label Aug 4, 2025

Quantum-Kayak mentioned this pull request Aug 4, 2025

Add option to assert if kernel is not fully fused in foreach_map #158970

Open

pytorchbot added the open source label Aug 4, 2025

Quantum-Kayak closed this Aug 4, 2025

pytorch-bot bot added the topic: not user facing topic category label Aug 4, 2025

Quantum-Kayak reopened this Aug 4, 2025

Quantum-Kayak mentioned this pull request Aug 4, 2025

foreach_map enhancements #158968

Open

janeyx99 requested a review from mlazos August 4, 2025 23:01

janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 4, 2025

pytorch-bot bot added the module: inductor label Aug 7, 2025

Quantum-Kayak requested review from albanD and janeyx99 as code owners August 7, 2025 05:34

albanD removed their request for review August 7, 2025 13:59

janeyx99 requested changes Aug 7, 2025

View reviewed changes

janeyx99 reviewed Aug 8, 2025

View reviewed changes

Quantum-Kayak added 2 commits August 9, 2025 00:59

Add matmul (torch.mm) fallback support to foreach_map_fn

0cd4d70

Add test for torch.mm fallback in foreach_map_fn

d1f887c

Quantum-Kayak force-pushed the foreach-map-matmul-fallback branch from 148a24e to d1f887c Compare August 9, 2025 08:18

janeyx99 reviewed Aug 9, 2025

View reviewed changes

Quantum-Kayak changed the title ~~Add fallback support for torch.mm in foreach_map_fn~~ foreach_map_fn: add UX fallback for torch.mm to avoid Dynamo graph breaks Aug 12, 2025

mlazos reviewed Aug 12, 2025

View reviewed changes

	self.assertTrue(torch.allclose(r, e), msg=f"Expected {e}, got {r}")
	self.assertEqual(r, e)

foreach_map_fn: add UX fallback for torch.mm to avoid Dynamo graph breaks #159757

Are you sure you want to change the base?

foreach_map_fn: add UX fallback for torch.mm to avoid Dynamo graph breaks #159757

Conversation

Quantum-Kayak commented Aug 4, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159757

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Quantum-Kayak commented Aug 4, 2025

Uh oh!

Quantum-Kayak commented Aug 5, 2025

Uh oh!

mlazos commented Aug 6, 2025

Uh oh!

Quantum-Kayak commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Quantum-Kayak commented Aug 7, 2025

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

janeyx99 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

janeyx99 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

mlazos Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

janeyx99 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

janeyx99 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Quantum-Kayak commented Aug 8, 2025

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

Quantum-Kayak commented Aug 9, 2025

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

mlazos commented Aug 11, 2025

Uh oh!

mlazos Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Quantum-Kayak commented Aug 4, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 4, 2025 •

edited

Loading

linux-foundation-easycla bot commented Aug 4, 2025 •

edited

Loading

Quantum-Kayak commented Aug 6, 2025 •

edited

Loading