string.Formatter.parse does not handle auto-numbered positional fields #89867

SDesch · 2021-11-03T11:55:48Z

BPO	45704
Nosy	@ericvsmith

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = None
created_at = <Date 2021-11-03.11:55:47.784>
labels = ['type-bug', '3.8', '3.9', '3.10', '3.11', '3.7']
title = 'string.Formatter.parse does not handle auto-numbered positional fields'
updated_at = <Date 2021-11-05.18:06:11.745>
user = 'https://bugs.python.org/SDesch'

bugs.python.org fields:

activity = <Date 2021-11-05.18:06:11.745>
actor = 'eric.smith'
assignee = 'none'
closed = False
closed_date = None
closer = None
components = []
creation = <Date 2021-11-03.11:55:47.784>
creator = 'SDesch'
dependencies = []
files = []
hgrepos = []
issue_num = 45704
keywords = []
message_count = 9.0
messages = ['405610', '405619', '405624', '405627', '405636', '405757', '405796', '405802', '405814']
nosy_count = 2.0
nosy_names = ['eric.smith', 'SDesch']
pr_nums = []
priority = 'normal'
resolution = None
stage = None
status = 'open'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue45704'
versions = ['Python 3.6', 'Python 3.7', 'Python 3.8', 'Python 3.9', 'Python 3.10', 'Python 3.11']

Linked PRs

gh-89867: string.Formatter auto numbering doc updates #129617

SDesch · 2021-11-03T11:55:48Z

It appears when adding auto-numbered positional fields in python 3.1 Formatter.parse was not updated to handle them and currently returns an empty string as the field name.

list(Formatter().parse('hello {}'))  # [('hello ', '', '', None)]

This does not align with Formatter.get_field which according to the docs: "Given field_name as returned by parse() (see above), convert it to an object to be formatted."

When supplying an empty string to .get_field() you get a KeyError

Formatter().get_field("", [1, 2, 3], {}). # raises KeyError

ericvsmith · 2021-11-03T14:00:28Z

For reference, the documentation is at https://docs.python.org/3/library/string.html#custom-string-formatting

I guess in your example it should return:
[('hello ', '0', '', None)]

SDesch · 2021-11-03T15:39:47Z

Yes it should return a string containing the index of the positional argument i.e. "0" so that it is compatible with .get_field(). Side note: It's a somewhat weird that .get_field expects a string while .get_value expects an int for positional arguments.

ericvsmith · 2021-11-03T16:12:53Z

Side note: It's a somewhat weird that .get_field expects a string while .get_value expects an int for positional arguments.

.parse is just concerned with parsing, so it works on and returns strings. .get_field takes strings because it is the thing that's trying to determine whether or not a field name looks like an integer or not. At least that's how I remember it.

SDesch · 2021-11-03T18:00:38Z

Another thing that occurred to me is the question of what .parse() should do when a mix of auto-numbered and manually numbered fields is supplied e.g. {}{1}. As of now .parse() happily processes such inputs and some other piece of code deals with this and ultimately raises an exception that mixing manual with automatic numbering is not allowed. If .parse() supported automatic numbering it would have to be aware of this too I guess?

ericvsmith · 2021-11-04T22:32:38Z

The more I think about this, the more I think it's not .parse's job to fill in the field numbers, it's the job of whoever is calling it.

Just as it's not .parse's job to give you an error if you switch back and forth between numbered and un-numbered fields.

It's literally just telling you what's in the string as it breaks it apart, not assigning any further meaning to the parts. I guess I should have called it .lex, not .parse.

SDesch · 2021-11-05T13:31:48Z

That definition of .parse() definitely makes sense. Do you then think this is out of scope for Formatter in general or just for .parse()?. Just for reference, this is what I currently use to get automatic numbering to work for my use case.

def parse_command_template(format_string):

    auto_numbering_error = ValueError(
        'cannot switch from automatic field numbering to manual field specification')

    index = 0
    auto_numbering = None

    for literal_text, field_name, spec, conversion in Formatter().parse(format_string):
        if field_name is not None:
            if field_name.isdigit():
                if auto_numbering is True:
                    raise auto_numbering_error
                auto_numbering = False

            if field_name == '':
                if auto_numbering is False:
                    raise auto_numbering_error
                auto_numbering = True
                field_name = str(index)
                index += 1

        yield literal_text, field_name, spec, conversion

ericvsmith · 2021-11-05T15:04:36Z

I think your code is rational. But since string.Formatter gets such little use, I'm not sure it's worth adding this to the stdlib. On the other hand, it could be used internal to string.Formatter.

We'd need to pick a better name, though. And maybe it should return the field_name as an int.

ericvsmith · 2021-11-05T18:06:12Z

That is, return field_name as an int if it's an int, otherwise as a string.

dg-pb · 2025-01-31T13:32:29Z

The more I think about this, the more I think it's not .parse's job to fill in the field numbers, it's the job of whoever is calling it.

I agree with this. .parse correctly returns empty string for empty field name. This is factorised well providing one-to-one relationship between input and output.

However, the main issue is that this is not possible for things with nesting to work.

If parsing only one level, then suggested parse change would return things correctly. E.g.:

list(string.Formatter().parse('{}{}'))
[('', '0', '', None), ('', '1', '', None)]

However, it would not return things correctly for:

list(string.Formatter().parse('{:.{}f}{}'))
[('', '0', '.{}f', None), ('', '1', '', None)]

While its equivalent of manually numbered fields would look like:

fmt = '{0:.{1}f}{2}'
# And correct parse output should be:
list(string.Formatter().parse('{:.{}f}{}'))
[('', '0', '.{}f', None), ('', '2', '', None)]

Auto-numbering is very easy to do outside it. Especially for a flat case. I suggest closing this as "Not planned".

encukou · 2025-02-03T12:50:57Z

That's reasonable! Looks like all that's left is to mention this in the parse/get_field docs.

…-129617)

SDesch mannequin added 3.7 (EOL) end of life 3.8 (EOL) end of life 3.9 only security fixes 3.10 only security fixes 3.11 only security fixes type-bug An unexpected behavior, bug, or error labels Nov 3, 2021

ezio-melotti transferred this issue from another repository Apr 10, 2022

iritkatriel added the interpreter-core (Objects, Python, Grammar, and Parser dirs) label Nov 28, 2023

This was referenced Jan 25, 2025

string.Formatter does not handle non-indexed item and attribute access #129273

Closed

gh-71494: string.Formatter unnumbered key/attributes #21767

Merged

picnixz removed interpreter-core (Objects, Python, Grammar, and Parser dirs) 3.11 only security fixes 3.10 only security fixes labels Feb 3, 2025

picnixz added stdlib Python modules in the Lib dir and removed 3.9 only security fixes 3.8 (EOL) end of life 3.7 (EOL) end of life labels Feb 3, 2025

bedevere-app bot mentioned this issue Feb 3, 2025

gh-89867: string.Formatter auto numbering doc updates #129617

Merged

encukou pushed a commit that referenced this issue Apr 30, 2025

gh-89867: string.Formatter auto numbering doc updates (GH-129617)

a4b7128

mth4saurabh pushed a commit to mth4saurabh/cpython that referenced this issue Apr 30, 2025

pythongh-89867: string.Formatter auto numbering doc updates (pythonGH…

b9c5098

…-129617)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

string.Formatter.parse does not handle auto-numbered positional fields #89867

string.Formatter.parse does not handle auto-numbered positional fields #89867

SDesch mannequin commented Nov 3, 2021 •

edited by bedevere-app bot

Loading

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 3, 2021

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 3, 2021

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 4, 2021

SDesch mannequin commented Nov 5, 2021

ericvsmith commented Nov 5, 2021

ericvsmith commented Nov 5, 2021

dg-pb commented Jan 31, 2025 •

edited

Loading

encukou commented Feb 3, 2025

string.Formatter.parse does not handle auto-numbered positional fields #89867

string.Formatter.parse does not handle auto-numbered positional fields #89867

Comments

SDesch mannequin commented Nov 3, 2021 • edited by bedevere-app bot Loading

Linked PRs

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 3, 2021

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 3, 2021

SDesch mannequin commented Nov 3, 2021

ericvsmith commented Nov 4, 2021

SDesch mannequin commented Nov 5, 2021

ericvsmith commented Nov 5, 2021

ericvsmith commented Nov 5, 2021

dg-pb commented Jan 31, 2025 • edited Loading

encukou commented Feb 3, 2025

SDesch mannequin commented Nov 3, 2021 •

edited by bedevere-app bot

Loading

dg-pb commented Jan 31, 2025 •

edited

Loading