Re: Unicode width data inconsistent/outdated

2017-08-08 Thread Corinna Vinschen
On Aug 7 21:27, Thomas Wolff wrote: > Am 07.08.2017 um 11:28 schrieb Corinna Vinschen: > > On Aug 5 21:06, Thomas Wolff wrote: > > > I have a working version now, and it uses much less as the category table > > > is > > > range-based. > > > Another table is needed for case conversion. Size estim

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Thomas Wolff
Am 07.08.2017 um 23:29 schrieb Brian Inglis: On 2017-08-07 13:30, Thomas Wolff wrote: Am 07.08.2017 um 21:07 schrieb Brian Inglis: Implementation considerations for handling the Unicode tables described in http://www.unicode.org/versions/Unicode10.0.0/ch05.pdf and implemented in https

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Brian Inglis
On 2017-08-07 13:30, Thomas Wolff wrote: > Am 07.08.2017 um 21:07 schrieb Brian Inglis: >> Implementation considerations for handling the Unicode tables described in >> http://www.unicode.org/versions/Unicode10.0.0/ch05.pdf >> and implemented in >> https://www.strchr.com/multi-stage_tables

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Thomas Wolff
Hi Brian, Am 07.08.2017 um 21:07 schrieb Brian Inglis: ... Implementation considerations for handling the Unicode tables described in http://www.unicode.org/versions/Unicode10.0.0/ch05.pdf and implemented in https://www.strchr.com/multi-stage_tables ICU icu4[cj] uses a folded tr

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Thomas Wolff
Am 07.08.2017 um 11:28 schrieb Corinna Vinschen: On Aug 5 21:06, Thomas Wolff wrote: Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: On Aug 3 21:44, Thomas Wolff wrote: My attempt would be to base the functions on a common table of character categories instead. ...Keep in mind that the ta

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Brian Inglis
On 2017-08-07 03:28, Corinna Vinschen wrote: > On Aug 5 21:06, Thomas Wolff wrote: >> Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: >>> On Aug 3 21:44, Thomas Wolff wrote: My attempt would be to base the functions on a common table of character categories instead. >>> Keep in mind t

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Corinna Vinschen
On Aug 7 11:28, Corinna Vinschen wrote: > On Aug 5 21:06, Thomas Wolff wrote: > > Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: > > > This shouldn't matter to you, just keep it in place. It's a historical, > > > low footprint conversion for japanese characters without pulling in the > > > uni

Re: Unicode width data inconsistent/outdated

2017-08-07 Thread Corinna Vinschen
On Aug 5 21:06, Thomas Wolff wrote: > Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: > > On Aug 3 21:44, Thomas Wolff wrote: > > > My attempt would be to base the functions on a common table of character > > > categories instead. > > Keep in mind that the table is not loaded into memory on dem

Re: Unicode width data inconsistent/outdated

2017-08-05 Thread Thomas Wolff
Am 05.08.2017 um 22:24 schrieb Brian Inglis: On 2017-08-05 13:06, Thomas Wolff wrote: ... Which other platforms do actually use newlib? Many historical uPs and current uCs used in embedded systems supporting gcc not using Linux, including RTEMS, devKits for Nintendo and Sony game systems, aome

Re: Unicode width data inconsistent/outdated

2017-08-05 Thread Brian Inglis
On 2017-08-05 13:06, Thomas Wolff wrote: > Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: >> On Aug 3 21:44, Thomas Wolff wrote: >>> Am 28.07.2017 um 21:58 schrieb Corinna Vinschen: On Jul 26 23:43, Thomas Wolff wrote: > Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: >> On Jul 26

Re: Unicode width data inconsistent/outdated

2017-08-05 Thread Thomas Wolff
Am 04.08.2017 um 19:01 schrieb Corinna Vinschen: On Aug 3 21:44, Thomas Wolff wrote: Am 28.07.2017 um 21:58 schrieb Corinna Vinschen: On Jul 26 23:43, Thomas Wolff wrote: Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: On Jul 26 03:16, Yaakov Selkowitz wrote: On 2017-07-26 03:08, Corinna V

Re: Unicode width data inconsistent/outdated

2017-08-04 Thread Corinna Vinschen
On Aug 3 21:44, Thomas Wolff wrote: > Am 28.07.2017 um 21:58 schrieb Corinna Vinschen: > > On Jul 26 23:43, Thomas Wolff wrote: > > > Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: > > > > On Jul 26 03:16, Yaakov Selkowitz wrote: > > > > > On 2017-07-26 03:08, Corinna Vinschen wrote: > > > > > >

Re: Unicode width data inconsistent/outdated

2017-08-03 Thread Thomas Wolff
Am 28.07.2017 um 21:58 schrieb Corinna Vinschen: On Jul 26 23:43, Thomas Wolff wrote: Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: On Jul 26 03:16, Yaakov Selkowitz wrote: On 2017-07-26 03:08, Corinna Vinschen wrote: On Jul 26 08:49, Thomas Wolff wrote: It would be good to keep wcwidth/w

Re: Unicode width data inconsistent/outdated

2017-07-28 Thread Corinna Vinschen
On Jul 26 23:43, Thomas Wolff wrote: > Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: > > On Jul 26 03:16, Yaakov Selkowitz wrote: > > > On 2017-07-26 03:08, Corinna Vinschen wrote: > > > > On Jul 26 08:49, Thomas Wolff wrote: > > > > > It would be good to keep wcwidth/wcswidth in sync with the i

Re: Unicode width data inconsistent/outdated

2017-07-26 Thread Thomas Wolff
Am 26.07.2017 um 11:50 schrieb Corinna Vinschen: On Jul 26 03:16, Yaakov Selkowitz wrote: On 2017-07-26 03:08, Corinna Vinschen wrote: On Jul 26 08:49, Thomas Wolff wrote: It would be good to keep wcwidth/wcswidth in sync with the installed Unicode data version (package unicode-ucd). Currently

Re: Unicode width data inconsistent/outdated

2017-07-26 Thread Corinna Vinschen
On Jul 26 03:16, Yaakov Selkowitz wrote: > On 2017-07-26 03:08, Corinna Vinschen wrote: > > On Jul 26 08:49, Thomas Wolff wrote: > >> It would be good to keep wcwidth/wcswidth in sync with the installed > >> Unicode data version (package unicode-ucd). > >> Currently it seems to be hard-coded (in ne

Re: Unicode width data inconsistent/outdated

2017-07-26 Thread Yaakov Selkowitz
On 2017-07-26 03:08, Corinna Vinschen wrote: > On Jul 26 08:49, Thomas Wolff wrote: >> It would be good to keep wcwidth/wcswidth in sync with the installed >> Unicode data version (package unicode-ucd). >> Currently it seems to be hard-coded (in newlib/libc/string/wcwidth.c); >> it refers to Unicod

Re: Unicode width data inconsistent/outdated

2017-07-26 Thread Corinna Vinschen
On Jul 26 08:49, Thomas Wolff wrote: > It would be good to keep wcwidth/wcswidth in sync with the installed > Unicode data version (package unicode-ucd). > Currently it seems to be hard-coded (in newlib/libc/string/wcwidth.c); > it refers to Unicode 5.0 while installed Unicode data suggest 9.0 woul

Unicode width data inconsistent/outdated

2017-07-25 Thread Thomas Wolff
It would be good to keep wcwidth/wcswidth in sync with the installed Unicode data version (package unicode-ucd). Currently it seems to be hard-coded (in newlib/libc/string/wcwidth.c); it refers to Unicode 5.0 while installed Unicode data suggest 9.0 would be used. I can provide some scripts to gen