Re: [Rd] collation order

2006-03-17 Thread Peter Dalgaard
Peter Dalgaard <[EMAIL PROTECTED]> writes: > > Sys.setlocale(locale="C") > [1] "C" > > "Thomas O'Malley" < "Thomas Lumley" > [1] TRUE > > Sys.setlocale(locale="en_US.UTF8") > [1] > "LC_CTYPE=en_US.UTF8;LC_NUMERIC=C;LC_TIME=en_US.UTF8;LC_COLLATE=en_US.UTF8;LC_MONETARY=en_US.UTF8;LC_MESSAGES=C;LC_

Re: [Rd] collation order

2006-03-17 Thread Peter Dalgaard
Thomas Lumley <[EMAIL PROTECTED]> writes: > The following caused a hard-to-diagnose problem for a user of the survey > package. Presumably this is a strange Unicode thing, but is there a > convenient reference for how the collation order is determined? I am > surprised that adding the same cha

Re: [Rd] collation order

2006-03-17 Thread Simon Urbanek
On Mar 17, 2006, at 4:32 PM, Thomas Lumley wrote: > The following caused a hard-to-diagnose problem for a user of the > survey package. Presumably this is a strange Unicode thing, It is independent of the encoding: [EMAIL PROTECTED]:~$ LC_COLLATE=en_US R --vanilla -q "1//"<"10/" [1] TRUE > "

[Rd] collation order

2006-03-17 Thread Thomas Lumley
The following caused a hard-to-diagnose problem for a user of the survey package. Presumably this is a strange Unicode thing, but is there a convenient reference for how the collation order is determined? I am surprised that adding the same character to the end of two strings of the same leng