gh-132762: Fix underallocation bug in dict.fromkeys() and expand test coverage #133627

angela-tarantula · 2025-05-08T00:32:23Z

Summary

dict_set_fromkeys() was only sizing its new table by the size of the iterable input, ignoring any existing entries in the dictionary. This triggered an infinite loop in dictresize() whenever the new dictionary size was too small to reinsert those entries. This patch adds the same Py_MAX(…, DK_LOG_SIZE(mp->ma_keys)) guard that dict_dict_fromkeys() uses, so we never accidentally shrink the table below its current capacity. The relevant test case baddict3 has been updated to cover this edge case.

For more background, see my proposal.

3 New Regression Tests

Ever since the fast path was updated, the slow path completely lost test coverage. To rectify this, I added 3 new tests:

1 slow-path test when the iterable input is neither a set nor a dictionary
1 slow-path test when fromkeys() is called on a proper subclass of dict, baddict4
1 fast-path test when the input is a set (worth testing explicitly now that dict_dict_fromkeys() and dict_set_fromkeys() are separate)

Thanks for the review! @DinoV @colesbury

Issue: dict_set_fromkeys() calculates size of dictionary improperly #132762

previously covered: - fast path for dictionary inputs - fast path when object's constructor returns non-empty dict (too small for good coverage) now additionally covered: - fast path for set inputs - slow path for non-set, non-dictionary inputs - fast path when object's constructor returns *large* non-empty dict - slow path when object is a proper subclass of dict

python-cla-bot · 2025-05-08T00:32:28Z

All commit authors signed the Contributor License Agreement.

bedevere-app · 2025-05-08T00:32:29Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

colesbury

Thanks, lgtm!

miss-islington-app · 2025-05-08T17:13:16Z

Thanks @angela-tarantula for the PR, and @colesbury for merging it 🌮🎉.. I'm working now to backport this PR to: 3.13, 3.14.
🐍🍒⛏🤖

…h-133627) The function `dict_set_fromkeys()` adds elements of a set to an existing dictionary. The size of the expanded dictionary was estimated with `PySet_GET_SIZE(iterable)`, which did not take into account the size of the existing dictionary. (cherry picked from commit 421ba58) Co-authored-by: Angela Liss <59097311+angela-tarantula@users.noreply.github.com>

miss-islington-app · 2025-05-08T17:13:24Z

Sorry, @angela-tarantula and @colesbury, I could not cleanly backport this to 3.13 due to a conflict.
Please backport using cherry_picker on command line.

cherry_picker 421ba589d02b53131f793889d221ef3b1f1410a4 3.13

bedevere-app · 2025-05-08T17:13:28Z

GH-133685 is a backport of this pull request to the 3.14 branch.

…ythongh-133627) The function `dict_set_fromkeys()` adds elements of a set to an existing dictionary. The size of the expanded dictionary was estimated with `PySet_GET_SIZE(iterable)`, which did not take into account the size of the existing dictionary. (cherry picked from commit 421ba58) Co-authored-by: Angela Liss <59097311+angela-tarantula@users.noreply.github.com>

bedevere-app · 2025-05-08T17:15:38Z

GH-133686 is a backport of this pull request to the 3.13 branch.

) (gh-133685) The function `dict_set_fromkeys()` adds elements of a set to an existing dictionary. The size of the expanded dictionary was estimated with `PySet_GET_SIZE(iterable)`, which did not take into account the size of the existing dictionary. (cherry picked from commit 421ba58) Co-authored-by: Angela Liss <59097311+angela-tarantula@users.noreply.github.com>

) (gh-133686) The function `dict_set_fromkeys()` adds elements of a set to an existing dictionary. The size of the expanded dictionary was estimated with `PySet_GET_SIZE(iterable)`, which did not take into account the size of the existing dictionary. (cherry picked from commit 421ba58) Co-authored-by: Angela Liss <59097311+angela-tarantula@users.noreply.github.com>

…h-133627) The function `dict_set_fromkeys()` adds elements of a set to an existing dictionary. The size of the expanded dictionary was estimated with `PySet_GET_SIZE(iterable)`, which did not take into account the size of the existing dictionary.

angela-tarantula added 3 commits May 7, 2025 20:24

dict_set_fromkeys now properly calculates new_size

7aa4034

modified a fromkeys() test to cover edge case where input is large set

5717052

angela-tarantula requested review from methane and markshannon as code owners May 8, 2025 00:32

bedevere-app bot mentioned this pull request May 8, 2025

dict_set_fromkeys() calculates size of dictionary improperly #132762

Closed

bedevere-app bot added the awaiting review label May 8, 2025

blurb-it bot and others added 2 commits May 8, 2025 13:48

📜🤖 Added by blurb_it.

b13afeb

Merge branch 'main' into fix-issue-132762

049f6f9

colesbury self-requested a review May 8, 2025 17:01

colesbury added needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels May 8, 2025

colesbury approved these changes May 8, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels May 8, 2025

colesbury merged commit 421ba58 into python:main May 8, 2025
47 checks passed

bedevere-app bot removed the awaiting merge label May 8, 2025

miss-islington-app bot assigned colesbury May 8, 2025

bedevere-app bot removed the needs backport to 3.14 bugs and security fixes label May 8, 2025

bedevere-app bot removed the needs backport to 3.13 bugs and security fixes label May 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-132762: Fix underallocation bug in dict.fromkeys() and expand test coverage #133627

gh-132762: Fix underallocation bug in dict.fromkeys() and expand test coverage #133627

Uh oh!

angela-tarantula commented May 8, 2025 •

edited

Loading

Uh oh!

python-cla-bot bot commented May 8, 2025 •

edited

Loading

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

colesbury left a comment

Uh oh!

Uh oh!

miss-islington-app bot commented May 8, 2025

Uh oh!

miss-islington-app bot commented May 8, 2025

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

Uh oh!

Uh oh!

gh-132762: Fix underallocation bug in dict.fromkeys() and expand test coverage #133627

gh-132762: Fix underallocation bug in dict.fromkeys() and expand test coverage #133627

Uh oh!

Conversation

angela-tarantula commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

python-cla-bot bot commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

miss-islington-app bot commented May 8, 2025

Uh oh!

miss-islington-app bot commented May 8, 2025

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

bedevere-app bot commented May 8, 2025

Uh oh!

Uh oh!

angela-tarantula commented May 8, 2025 •

edited

Loading

python-cla-bot bot commented May 8, 2025 •

edited

Loading