Hi Jim,

Among the news for grep 3.12:

>   On Windows platforms and on AIX in 32-bit mode, grep now supports
>   Unicode characters outside the Basic Multilingual Plane.

I wouldn't claim this, since the support is not yet complete. To make
it complete, the Gnulib 'regex' module needs to switch from 'wchar_t'
to 'char32_t' as well. I have started work on this, but it's not
ready for consumption yet, since I plan(ned) to strengthen the unit
tests of the 'regex' module first, and that is a major undertaking.

But maybe I should change the plan and instead of strengthening the unit
tests inside gnulib, just make the change and use grep's tests on Cygwin,
to verify the result. (On glibc platforms where wchar_t and char32_t
are of the same size and same semantics, the changes will be a no-op
anyway.)

With that changed plan, the stated improvement could be ready in
two weeks. Would you agree with that changed plan? Would you be willing
to delay the grep-3.12 release by two weeks, to include the completion
of this feature?

Bruno




Reply via email to