GH-132554: Fix tier2 `FOR_ITER` implementation and optimizations #135137

markshannon · 2025-06-04T14:09:41Z

When adding virtual iterators, the tier 1 and tier 2 implementations of FOR_ITER diverged. I've already fixed a problem where the instrumented FOR_ITER differed from the normal one.

To prevent these problems happening again, this PR factors out the majority of FOR_ITER into a helper function for the 3 versions of FOR_ITER to share.

I've also added PyStackRef_ERROR to distinguish between errors and no result and remove the need for an additional out parameter for the helper function.

Also fixes a bug in the code generator where there are three or more output values, one is an unchanged input, one is a changed input and one is undefined.

Issue: Use tagged ints for faster iteration #132554

Fidget-Spinner · 2025-06-04T15:45:15Z

Can you check out the Windows JIT failures please? I'll review the PR after that.

brandtbucher · 2025-06-05T04:23:59Z

Include/internal/pycore_ceval.h

@@ -353,7 +353,8 @@ PyAPI_FUNC(_PyStackRef) _PyFloat_FromDouble_ConsumeInputs(_PyStackRef left, _PyS
 extern int _PyRunRemoteDebugger(PyThreadState *tstate);
 #endif

-_PyStackRef _PyForIter_NextWithIndex(PyObject *seq, _PyStackRef index);
+_PyStackRef


Suggested change

_PyStackRef

PyAPI_FUNC(_PyStackRef)

brandtbucher · 2025-06-05T04:27:59Z

Python/optimizer_bytecodes.c

+    op(_GET_ITER, (iterable -- iter, index_or_null)) {
+        if (sym_matches_type(iterable, &PyTuple_Type) || sym_matches_type(iterable, &PyList_Type)) {
+            iter = iterable;
+            index_or_null = sym_new_type(ctx, &PyLong_Type);


Hm, this is sort of weird. We don't have a symbol for "unboxed int" in the JIT, but it really doesn't feel correct to type this as int. Maybe leave as unknown and we can update our lattice with unboxed C types later? It's not like this information is being used yet, anyways.

Suggested change

index_or_null = sym_new_type(ctx, &PyLong_Type);

index_or_null = sym_new_unknown(ctx);

brandtbucher · 2025-06-05T04:29:31Z

Python/bytecodes.c

                }
-                next = PyStackRef_FromPyObjectSteal(next_o);
+                JUMPBY(oparg + 1);


Can you add back the comment that this is skipping the END_FOR?

Suggested change

JUMPBY(oparg + 1);

// Jump forward by oparg, then skip the following END_FOR:

JUMPBY(oparg + 1);

brandtbucher · 2025-06-05T04:30:20Z

Python/bytecodes.c

                }
+                JUMPBY(oparg + 1);


Suggested change

JUMPBY(oparg + 1);

// Jump forward by oparg, then skip the following END_FOR:

JUMPBY(oparg + 1);

brandtbucher · 2025-06-05T04:42:26Z

Python/ceval.c

+                _PyErr_Clear(tstate);
+            }
+            else {
+                 return PyStackRef_ERROR;


Suggested change

return PyStackRef_ERROR;

return PyStackRef_ERROR;

brandtbucher · 2025-06-05T04:50:21Z

Confirmed that this fixed the pprint benchmarks locally. Just kicked off new benchmarks now.

brandtbucher · 2025-06-05T16:04:05Z

27% slower... ;)

markshannon added 3 commits June 4, 2025 10:56

Fix FOR_ITER for tier 2

eedbe8b

Add test

a10eadb

Update debug stackref

5725821

markshannon requested a review from Fidget-Spinner as a code owner June 4, 2025 14:09

bedevere-app bot added the awaiting core review label Jun 4, 2025

bedevere-app bot mentioned this pull request Jun 4, 2025

Use tagged ints for faster iteration #132554

Open

markshannon requested review from brandtbucher and removed request for Fidget-Spinner June 4, 2025 14:43

markshannon added the skip news label Jun 4, 2025

brandtbucher approved these changes Jun 5, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Jun 5, 2025

Address review comments

cc09774

markshannon merged commit b90ecea into python:main Jun 5, 2025
67 checks passed

bedevere-app bot removed the awaiting merge label Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-132554: Fix tier2 `FOR_ITER` implementation and optimizations #135137

GH-132554: Fix tier2 `FOR_ITER` implementation and optimizations #135137

Uh oh!

markshannon commented Jun 4, 2025 •

edited by bedevere-app bot

Loading

Uh oh!

Fidget-Spinner commented Jun 4, 2025

Uh oh!

brandtbucher Jun 5, 2025

Uh oh!

brandtbucher Jun 5, 2025

Uh oh!

brandtbucher Jun 5, 2025

Uh oh!

brandtbucher Jun 5, 2025

Uh oh!

brandtbucher Jun 5, 2025

Uh oh!

brandtbucher commented Jun 5, 2025

Uh oh!

brandtbucher commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

	index_or_null = sym_new_type(ctx, &PyLong_Type);
	index_or_null = sym_new_unknown(ctx);

	JUMPBY(oparg + 1);
	// Jump forward by oparg, then skip the following END_FOR:
	JUMPBY(oparg + 1);

Uh oh!

GH-132554: Fix tier2 FOR_ITER implementation and optimizations #135137

GH-132554: Fix tier2 FOR_ITER implementation and optimizations #135137

Uh oh!

Conversation

markshannon commented Jun 4, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Jun 4, 2025

Uh oh!

brandtbucher Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

brandtbucher Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

brandtbucher Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

brandtbucher Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

brandtbucher Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

brandtbucher commented Jun 5, 2025

Uh oh!

brandtbucher commented Jun 5, 2025

Uh oh!

Uh oh!

Uh oh!

GH-132554: Fix tier2 `FOR_ITER` implementation and optimizations #135137

GH-132554: Fix tier2 `FOR_ITER` implementation and optimizations #135137

markshannon commented Jun 4, 2025 •

edited by bedevere-app bot

Loading