Problem inserting Unicode heavy minus sign (U+2796)

2013-06-27 Thread Linda Walsh
Have an oddity in bash. Tried to insert the heavy minus sign and I get deterministic garbage: echo "^K$"|hexdump 000 9ee2 0b01 0a24 vs. if I use 'cat', or just raw input to hexdump: hexdump ➖000 9ee2 0096 003 (that minus sign before the first lin

Re: locale specific ordering in EN_US vs. characterset collation rules for UTF-8

2013-06-27 Thread Linda Walsh
Chet Ramey wrote: The world is larger than glibc and the glibc locale definitions. We need a solution that encompasses all of it. That solution should, and maybe will, include glibc, but that is not sufficient by itself. I don't suppose it is possible to use the Unicode collatio

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Eli Zaretskii
> Date: Thu, 27 Jun 2013 15:27:40 +0200 > From: Paolo Bonzini > Cc: Aharon Robbins , bug-bash@gnu.org, bug-...@gnu.org > > I would suggest distros to rip out the #else part of this #ifndef. Right, and then this will really be a dark, sad world, since Gawk is currently the only ray of light in th

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Chet Ramey
On 6/27/13 10:54 AM, Paolo Bonzini wrote: > > Fixing gnulib ensures that, on non-GNU systems, everyone will get the > same result from gnulib users (BTW, note that grep's fix for the problem > did not introduce a particular policy: instead, it just relied on > gnulib's fix). It's the best you ca

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Chet Ramey
On 6/27/13 8:11 AM, Johannes Meixner wrote: > I do not care too much which kind of locale specific ordering > or collating or regex behaviour is actually implemented > as long as it works consistently in grep, gawk, sed, bash,... This is Arnold's goal. And I would add to it that we need the diff

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Chet Ramey
On 6/27/13 4:48 AM, Paolo Bonzini wrote: > Il 27/06/2013 09:33, Aharon Robbins ha scritto: >> Hi Paolo. >> I still believe that there is no place other than the glibc locale descriptions where this can be fixed. >> This is necessary but not sufficient. All of gawk, grep, sed and bash >> r

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Chet Ramey
On 6/27/13 3:33 AM, Aharon Robbins wrote: > Hi Paolo. > >> I still believe that there is no place other than the glibc locale >> descriptions where this can be fixed. > > This is necessary but not sufficient. All of gawk, grep, sed and bash > run on lots of non-GLIBC systems. The locale definiti

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Johannes Meixner
Hello, On Jun 27 10:48 Paolo Bonzini wrote (excerpt): Il 27/06/2013 09:33, Aharon Robbins ha scritto: Fortunately, gawk and grep are already there, and I think the sed in the git repo is as well. Once Bash turns this on as default, the world will definitely be a better place, independent of

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Paolo Bonzini
Il 27/06/2013 16:31, Aharon Robbins ha scritto: I would suggest distros to rip out the #else part of this #ifndef. >>> >>> And I wouldn't, but as I have no control over the distros, I'm >>> not going to worry about it. >>> >>> All I know is that with the non-gawk case, gawk fails its test suit

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Aharon Robbins
> >> I would suggest distros to rip out the #else part of this #ifndef. > > > > And I wouldn't, but as I have no control over the distros, I'm > > not going to worry about it. > > > > All I know is that with the non-gawk case, gawk fails its test suite, > > and I've given up discussing it. > > Yo

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Paolo Bonzini
Il 27/06/2013 15:46, Aharon Robbins ha scritto: >> Date: Thu, 27 Jun 2013 15:27:40 +0200 >> From: Paolo Bonzini >> Subject: Re: locale specific ordering in EN_US -- why is a> >> Right now only gawk is different from the others, and not in a very >> clean manner: >> >> #ifndef GAWK >>

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Aharon Robbins
> Date: Thu, 27 Jun 2013 15:27:40 +0200 > From: Paolo Bonzini > Subject: Re: locale specific ordering in EN_US -- why is a > Right now only gawk is different from the others, and not in a very > clean manner: > > #ifndef GAWK > /* Defer to the system regex library about the meaning >

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Paolo Bonzini
Il 27/06/2013 14:11, Johannes Meixner ha scritto: > > Hello, > > On Jun 27 10:48 Paolo Bonzini wrote (excerpt): >> Il 27/06/2013 09:33, Aharon Robbins ha scritto: >>> >>> Fortunately, gawk and grep are already there, and I think the sed in >>> the git repo is as well. Once Bash turns this on as

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Paolo Bonzini
Il 27/06/2013 09:33, Aharon Robbins ha scritto: > Hi Paolo. > >> > I still believe that there is no place other than the glibc locale >> > descriptions where this can be fixed. > This is necessary but not sufficient. All of gawk, grep, sed and bash > run on lots of non-GLIBC systems. On non-glibc

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Aharon Robbins
Hi Paolo. > I still believe that there is no place other than the glibc locale > descriptions where this can be fixed. This is necessary but not sufficient. All of gawk, grep, sed and bash run on lots of non-GLIBC systems. The locale definitions, even for the same locale, vary wildly out in the

[gnu.org #838460] bash manual correction

2013-06-27 Thread Adrián via RT
Hello, > [ariy...@gmail.com - Tue Jun 25 16:04:34 2013]: > > Hello. > I have found a little difference between bash behaviour and it's > description in manual. > > According to the info from the url > http://www.gnu.org/software/bash/manual/html_node/Word-Designators.html > the symbol "$" design

Re: locale specific ordering in EN_US -- why is a

2013-06-27 Thread Paolo Bonzini
Il 26/06/2013 16:15, Pádraig Brady ha scritto: >>> > > This is why I started the Campaign For Rational Range Interpretation, >>> > > now part of gawk and I believe in the most recent grep also, which >>> > > returns us to the sane days of yesteryear, where [a-z] got only >>> > > lowercase >>> > >