Ezio Melotti added the comment:
> If the problem is that PyUnicode_FromUnicode() rejects character
> outside range [U+0000; U+10ffff],
But this used to return two valid characters:
>>> str(array('u', b'asdf'))
"array('u', '獡晤')"
so I think it still should -- unless the operation was already nonsensical
and/or there's no way to do the same thing on 3.3+ due to the change introduced
by PEP 393.
> it would be better to use the byte string '\xff' * sizeof_PY_UNICODE.
What for?
> U+66647361 may become valid in a future version of Unicode,
It won't.
----------
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue17223>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com