Bug#223068: sorting in unicode broken

2005-12-07 Thread Bob Proulx
Tomasz Wegrzanowski wrote: > $ cat X > a a > a c > ab c > $ cat X | LC_COLLATE=C sort Ordered based upon the underlying ASCII data encoding of the characters. > a a > a c > ab c > $ cat X | LC_COLLATE=pl_PL.UTF-8 sort > a a > ab c > a c Ordered based upon what the pl_PL locale ordering of the ch

Bug#223068: sorting in unicode broken

2005-12-07 Thread Michael Stone
You know what, I don't even care enough to discuss it anymore. The bug is wontfix and will stay that way. -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Bug#223068: sorting in unicode broken

2005-12-07 Thread Tomasz Wegrzanowski
2005/12/7, Michael Stone <[EMAIL PROTECTED]>: > On Wed, Dec 07, 2005 at 07:54:59PM +0100, Tomasz Wegrzanowski wrote: > >http://mathworld.wolfram.com/LexicographicOrder.html > > That's a nice reference and all, but irrelevant. Or are you claiming that > sort somehow generates non-sorted data? $ cat

Bug#223068: sorting in unicode broken

2005-12-07 Thread Michael Stone
On Wed, Dec 07, 2005 at 07:54:59PM +0100, Tomasz Wegrzanowski wrote: http://mathworld.wolfram.com/LexicographicOrder.html That's a nice reference and all, but irrelevant. Or are you claiming that sort somehow generates non-sorted data? The issue is that some symbols simply have no meaning for c

Bug#223068: sorting in unicode broken

2005-12-07 Thread Tomasz Wegrzanowski
2005/12/7, Michael Stone <[EMAIL PROTECTED]>: > On Wed, Dec 07, 2005 at 04:32:06AM +0100, Tomasz Wegrzanowski wrote: > >The problem is that sorting has two purposes: > >* sorting by scripts > >* sorting so the things are in the right order for humans > > > >The first: > >* simply has to be lexicogr

Bug#223068: sorting in unicode broken

2005-12-07 Thread Michael Stone
On Wed, Dec 07, 2005 at 04:32:06AM +0100, Tomasz Wegrzanowski wrote: The problem is that sorting has two purposes: * sorting by scripts * sorting so the things are in the right order for humans The first: * simply has to be lexicographical, otherwise too many things break I think that this wor

Bug#223068: sorting in unicode broken

2005-12-06 Thread Tomasz Wegrzanowski
The problem is that sorting has two purposes: * sorting by scripts * sorting so the things are in the right order for humans The first: * simply has to be lexicographical, otherwise too many things break * is much more common use of sort utility The second: * is less common * may use weird the lo

Bug#223068: sorting in unicode broken

2005-12-03 Thread Peter Eisentraut
There is no bug here. There is an ISO standard and associated national standards that determine how strings should be sorted. glibc is just following that standard. And of course coreutils has nothing to do with that; it just defers to the user's locale setting. I suggest that this bug can