[issue43703] xml.etree parser does not accept valid control characters

2021-04-02 Thread Romuald Brunet
Romuald Brunet added the comment: Thanks for the quick reply We're getting data from about a hundred different providers around the world; some of them not really keen on standards, so we already have some hacks to fix invalid XML. We'll add one to the list In that particular case, the XML

[issue43703] xml.etree parser does not accept valid control characters

2021-04-02 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: It is a known issue, see issue11804 and issue39512. In short, the underlying library for XML parsing (expat) does not support XML 1.1 and does not have plans to support it. And seems that XML 1.1 is a dead standard if it is not supported in popular parsing

[issue43703] xml.etree parser does not accept valid control characters

2021-04-02 Thread Karthikeyan Singaravelan
Change by Karthikeyan Singaravelan : -- nosy: +eli.bendersky, scoder, serhiy.storchaka ___ Python tracker ___ ___ Python-bugs-list m

[issue43703] xml.etree parser does not accept valid control characters

2021-04-02 Thread Romuald Brunet
New submission from Romuald Brunet : Python XML parser (xml.etree) does not seems to allow control characters that are invalid in XML 1.0, but valid in XML 1.1 [1] [2] Considering the following sample: import xml.etree.ElementTree as ET bad = 'bar baz' print(ET.fromstring(bad)) The pars