bpo-34222: Lib/email: Fix infinite loop when folding #8990

Xiami2012 · 2018-08-29T11:06:50Z

Currently when folding headers with length > maxlen, _fold_as_ew tries
to split the to_encode into multiple parts to fulfill the maxlen limit,
in an inapropriate way.

If a long header has non-ascii characters, in some situations (e.g. a
Subject: with full of CJK chars), it will split the to_encode into
["", to_encode], entering an infinite loop.

This commit fixes this by introduce a smarter way to split.
Besides, when an header needs to be folded now, every non-last line will
try its best to reach the maxlen, in O(log N) time.
Also, apply missing charset= parameter for _ew.encode.

The bug is introduced in commit 85d5c18

https://bugs.python.org/issue34222

the-knights-who-say-ni · 2018-08-29T11:06:53Z

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept your contribution by verifying you have signed the PSF contributor agreement (CLA).

Unfortunately we couldn't find an account corresponding to your GitHub username on bugs.python.org (b.p.o) to verify you have signed the CLA (this might be simply due to a missing "GitHub Name" entry in your b.p.o account settings). This is necessary for legal reasons before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

You can check yourself to see if the CLA has been received.

Thanks again for your contribution, we look forward to reviewing it!

Currently when folding headers with length > maxlen, _fold_as_ew tries to split the to_encode into multiple parts to fulfill the maxlen limit, in an inapropriate way. If a long header has non-ascii characters, in some situations (e.g. a Subject: with full of CJK chars), it will split the to_encode into ["", to_encode], entering an infinite loop. This commit fixes this by introducing a smarter way to split. Besides, when an header needs to be folded now, every non-last line will try its best to reach the maxlen, in O(log N) time. Also, apply missing charset= parameter for _ew.encode. The bug is introduced in commit 85d5c18

georgschoelly · 2019-02-05T08:34:38Z

Lib/email/_header_value_parser.py

+        if len(ew) > remaining_space:
+            # Find the longest first_part
+            # since len(_ew.encode(to_encode[:x])) is a non-linear
+            # monotonically increasing function, and calculating the


_ew.encode is biased towards the 'q' encoding. This might violate the assumption of a monotonically increasing function for some corner cases. (This was already the case for the old code.)

I hope to find the time to write a test case for this.

csabella · 2019-06-05T16:16:39Z

Thank you for the contribution. This was fixed in GH-12020, so I'm closing this as a duplicate.

Xiami2012 requested a review from a team as a code owner August 29, 2018 11:06

the-knights-who-say-ni added the CLA not signed label Aug 29, 2018

bedevere-bot added the awaiting review label Aug 29, 2018

the-knights-who-say-ni added CLA signed and removed CLA not signed labels Aug 30, 2018

Xiami2012 added 2 commits August 30, 2018 11:18

Lib/test/test_email: Fix tests

064a5f2

georgschoelly reviewed Feb 5, 2019

View reviewed changes

csabella closed this Jun 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-34222: Lib/email: Fix infinite loop when folding #8990

bpo-34222: Lib/email: Fix infinite loop when folding #8990

Uh oh!

Xiami2012 commented Aug 29, 2018 •

edited by bedevere-bot

Loading

Uh oh!

the-knights-who-say-ni commented Aug 29, 2018

Uh oh!

georgschoelly Feb 5, 2019

Uh oh!

csabella commented Jun 5, 2019

Uh oh!

Uh oh!

Uh oh!

bpo-34222: Lib/email: Fix infinite loop when folding #8990

bpo-34222: Lib/email: Fix infinite loop when folding #8990

Uh oh!

Conversation

Xiami2012 commented Aug 29, 2018 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

the-knights-who-say-ni commented Aug 29, 2018

Uh oh!

georgschoelly Feb 5, 2019

Choose a reason for hiding this comment

Uh oh!

csabella commented Jun 5, 2019

Uh oh!

Uh oh!

Xiami2012 commented Aug 29, 2018 •

edited by bedevere-bot

Loading