gh-64612: Update error handlers list under `open()` #137304

StanFromIreland · 2025-08-01T14:14:58Z

Issue: Errors in documentation of standard codec error handlers #64612

📚 Documentation preview 📚: https://cpython-previews--137304.org.readthedocs.build/

Doc/library/functions.rst

encukou

I'm not convinced the two-column table, with name in the first column and prose in the second, is better than a buletted list. (Especially as two tables with unaligned columns.)

The "reproduced below for convenience" sounds like the tables should be the same. Perhaps "summarized below for convenience" would be better, with additional details left out?

Doc/library/functions.rst

encukou · 2025-08-06T13:29:41Z

Doc/library/functions.rst

+          when writing data.  This is useful for processing files in an
+          unknown encoding.
+      * - ``'surrogatepass'``
+        - Only available for Unicode codecs.


Aren't these all Unicode codecs?

Suggested change

- Only available for Unicode codecs.

- Only available for UTF-8, UTF-16 and UTF-32 codecs.

The codecs documentation lists the little/big endian variants, though I think wr can be less specific here.

We can, but “Unicode codecs” sounds like a proper term, while I see no definition that would link it to the UTF-{8,16,32} codecs specifically.

Doc/library/functions.rst

StanFromIreland · 2025-08-06T14:06:55Z

I'm not convinced the two-column table, with name in the first column and prose in the second, is better than a buletted list

I see, it is a table in the codecs docs and so, in an attempt to make them consistent converted it to (albeit a more convenient form of) a table.

encukou

With 3-5 lines per entry, it could just as well be a copy of the original table.
What about making the summary super brief, something like:

strict: raise UnicodeError
ignore: omit malformed data
replace: replace with ? or �
backslashreplace: replace with \xhh, \uhhhh, or \Uhhhhhhhh

and so on?

encukou · 2025-08-11T12:42:58Z

Doc/library/functions.rst

+          when writing data.  This is useful for processing files in an
+          unknown encoding.
+      * - ``'surrogatepass'``
+        - Only available for Unicode codecs.


We can, but “Unicode codecs” sounds like a proper term, while I see no definition that would link it to the UTF-{8,16,32} codecs specifically.

StanFromIreland · 2025-08-11T13:02:40Z

Counter proposal, why bother with super short summaries when we can just link straight to the main table?

encukou · 2025-08-11T13:19:29Z

That sounds good, too!

Commit

6479dce

StanFromIreland requested review from ncoghlan and malemburg August 1, 2025 14:14

bedevere-app bot added awaiting review docs Documentation in the Doc dir skip news labels Aug 1, 2025

github-project-automation bot added this to Docs PRs Aug 1, 2025

github-project-automation bot moved this to Todo in Docs PRs Aug 1, 2025

bedevere-app bot mentioned this pull request Aug 1, 2025

Errors in documentation of standard codec error handlers #64612

Open

aisk reviewed Aug 2, 2025

View reviewed changes

Doc/library/functions.rst Outdated Show resolved Hide resolved

encukou reviewed Aug 6, 2025

View reviewed changes

Petr's suggestions

fd1b26e

StanFromIreland requested a review from encukou August 6, 2025 15:24

encukou reviewed Aug 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-64612: Update error handlers list under `open()` #137304

gh-64612: Update error handlers list under `open()` #137304

StanFromIreland commented Aug 1, 2025 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

encukou left a comment

Uh oh!

Uh oh!

encukou Aug 6, 2025

Uh oh!

StanFromIreland Aug 6, 2025

Uh oh!

encukou Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

StanFromIreland commented Aug 6, 2025

Uh oh!

encukou left a comment

Uh oh!

encukou Aug 11, 2025

Uh oh!

StanFromIreland commented Aug 11, 2025

Uh oh!

encukou commented Aug 11, 2025

Uh oh!

Uh oh!

	- Only available for Unicode codecs.
	- Only available for UTF-8, UTF-16 and UTF-32 codecs.

Uh oh!

gh-64612: Update error handlers list under open() #137304

Are you sure you want to change the base?

gh-64612: Update error handlers list under open() #137304

Conversation

StanFromIreland commented Aug 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

encukou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

encukou Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

StanFromIreland Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

encukou Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

StanFromIreland commented Aug 6, 2025

Uh oh!

encukou left a comment

Choose a reason for hiding this comment

Uh oh!

encukou Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

StanFromIreland commented Aug 11, 2025

Uh oh!

encukou commented Aug 11, 2025

Uh oh!

Uh oh!

gh-64612: Update error handlers list under `open()` #137304

gh-64612: Update error handlers list under `open()` #137304

StanFromIreland commented Aug 1, 2025 •

edited by github-actions bot

Loading