[Python-Dev] Re: An f-string issue [Was: Re: Re: What to do about invalid escape sequences]

Eric V. Smith Sat, 10 Aug 2019 17:26:44 -0700

n 8/10/2019 7:46 PM, Glenn Linderman wrote:

Because of the "invalid escape sequence" and "raw string" discussion,when looking at the documentation, I also noticed the followingdescription for f-strings:
Escape sequences are decoded like in ordinary string literals (exceptwhen a literal is also marked as a raw string). After decoding, thegrammar for the contents of the string is:
followed by lots of stuff, followed by
Backslashes are not allowed in format expressions and will raise anerror:
f"newline: {ord('\n')}"   # raises SyntaxError
What I don't understand is how, if f-strings are processed ASDESCRIBED, how the \n is ever seen by the format expression.

If I recall correctly, the mentioned decoding is happening on the stringliteral parts of the f-strings (above, the "newline: " part), not theexpression parts (inside the {}). But it's been a while and I don'trecall all of the details.

The description is that they are first decoded like ordinary strings,and then parsed for the internal grammar containing {} expressions tobe expanded. If that were true, the \n in the above example wouldalready be a newline character, and the parsing of the formatexpression would not see the backslash. And if it were true, thatwould actually be far more useful for this situation.
So given that it is not true, why not? And why go to the extra work ofprohibiting \ in the format expressions?

It's a future-proofing thing. See the discussion athttps://mail.python.org/archives/list/python-dev@python.org/thread/EVXD72IYUN2APF2443OMADKA5WJTOKHD/It has pointers to other parts of the discussion.

At some point, I'm planning on switching the parsing of f-strings fromthe custom parser (see Python/ast.c, FstringParser_ConcatFstring()) tohaving the python parser itself parse the f-strings. This will besimilar to PEP 536, which doesn't have much detail, but does describesome of the motivations.

The PEP 498, of course, has an apparently more accurate description,that the {} parsing actually happens before the escape processing.Perhaps this avoids making multiple passes over the string to do thework, as the literal pieces and format expression pieces have to beseparate in the generated code, but that is just my speculation: I'dlike to know the real reason.
Should the documentation be fixed to make the description moreaccurate? If so, I'd be glad to open an issue.

Sure. I'm always in favor of accuracy. The f-string documentation was alast-minute rush job that could have used a lot more editing, and moreeyes are always welcome.

But it will take a fair amount of research to understand it well enoughto document it in more detail.

The PEP further contains the inaccurate statement:
Like all raw strings in Python, no escape processing is done for rawf-strings:
not mentioning the actual escape processing that is done for rawstrings, regarding \" and \'.


It should probably just say it uses the same rules as raw strings.

Eric

_______________________________________________
Python-Dev mailing list -- python-dev@python.org
To unsubscribe send an email to python-dev-le...@python.org
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/python-dev@python.org/message/FKNEBB5HTMRX4RWLPTZN5K2WRZ5W7MI6/

[Python-Dev] Re: An f-string issue [Was: Re: Re: What to do about invalid escape sequences]

Reply via email to