Re: Minor utf32-to-utf8 bug

2019-11-11 Thread Chet Ramey
On 11/10/19 9:07 AM, István Pásztor wrote: > Hi > > The encoding of six bytes long utf-8 sequences is buggy. Today unicode > requires at most 4 bytes long utf-8 sequences but if we handle 5 and 6 too > then let's do it the right way. Thanks for the report and fix. Chet -- ``The lyf so short, t

Minor utf32-to-utf8 bug

2019-11-10 Thread István Pásztor
Hi The encoding of six bytes long utf-8 sequences is buggy. Today unicode requires at most 4 bytes long utf-8 sequences but if we handle 5 and 6 too then let's do it the right way. The attached patch was created using a fresh master clone (d894cfd104086ddf68c286e67a5fb2e02eb43b7b). I'm writing a