Roger Marquis wrote:
> >because the LC_COLLATE rules for the C locale don't ignore
> >whitespace
> 
> I could find nothing in the POSIX Locale spec that requires every
> flag of a command to ignore whitespace. Such a requirement would
> be ridiculous on its face.

POSIX requires that sort respect the values of LANG, LC_COLLATE and
LC_ALL to set the locale for ordering rules.

  http://www.opengroup.org/onlinepubs/009695399/utilities/sort.html

> >Not broken, more working as designed (as is coreutils). Your
> >expectations may not match the behavior, but it is consistent
> >with the design (and documentation) of the sort command.
> 
> Your opinion, probably motivated by the desire avoid fixing a bug.
> Sort's -M flag was designed to sort the syslog date field, and
> ignore whitespace, and there is no Unicode requirement that
> conflicts with that design.

You don't like it.  I don't like it.  I don't know of anyone who likes
it.  But just the same the powers-that-be have created the locale
collating sequences in such a way that whitespace and punctuation is
ignored and case is folded.  Personally I think they confused working
with data on a computer with writing text about working on a computer.

You probably need to set LC_ALL=C in your environment.

> >>This is not an issue, however, because Solaris does not set the
> >>default LANG to UTF-8
> >Neither does debian, that's an install-time choice. (Solaris has
> >the same kind of install-time locale selection.)
> 
> I can't recall changing this default during installation. What
> should /etc/environment have set LANG to?

Try this:

  sudo dpkg-reconfigure locales

In answer to the debconf question "Which locale should be the default
in the system environment?" select "None".

Bob


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to