Changes by Kang-Hao (Kenny) Lu :
--
nosy: +kennyluck
___
Python tracker
<http://bugs.python.org/issue12100>
___
___
Python-bugs-list mailing list
Unsubscribe:
Kang-Hao (Kenny) Lu added the comment:
> and the table on the documentation of the codec module suggests *utf_8*
> as the name of the codec, which I believe to be equivalent to "utf_8"
> because '-' is not a valid character of an identifier.
typo: equivalent to &
Kang-Hao (Kenny) Lu added the comment:
> The followings are on my TODO list, although this patch doesn't depend
> on any of these and can be reviewed and landed separately:
> * make the surrogatepass error handler work for utf-16 and utf-32. (I
>should be able to fin
New submission from Kang-Hao (Kenny) Lu :
Currently the "surrogatepass" handler always encodes the surrogates in UTF-8
and hence the behavior for, say, "\udc80".encode("latin-1",
"surrogatepass").decode("latin-1") might be unexpected and
Changes by Kang-Hao (Kenny) Lu :
--
type: -> behavior
___
Python tracker
<http://bugs.python.org/issue13913>
___
___
Python-bugs-list mailing list
Unsubscri
New submission from Kang-Hao (Kenny) Lu :
Since Python 3.2.2 (I don't have earlier version to test with),
>>> "\udc80".encode("utf-8")
UnicodeEncodeError: *utf-8* codec can't encode character '\udc80'...
but
>>> b"\xff".dec
Changes by Kang-Hao (Kenny) Lu :
--
nosy: +kennyluck
___
Python tracker
<http://bugs.python.org/issue7856>
___
___
Python-bugs-list mailing list
Unsubscribe:
Kang-Hao (Kenny) Lu added the comment:
Attached patch does the following beyond what the patch from haypo does:
* call the error handler
* reject 0xd800~0xdfff when decoding utf-32
The followings are on my TODO list, although this patch doesn't depend on any
of these and can be rev