On Tue, Sep 19, 2006 at 08:37:33PM +0300, Martin-Éric Racine wrote: > > On Tue, September 19, 2006 13:27, Agustin Martin wrote: > > You can use 'sed 1d' to strip first line and make this less verbose. This > > is > > what I do for the esperanto dictionary (with eo changed to ru): > > > > cat ru.dic | sed 1d | LC_COLLATE=C sort -u | prezip > ru.cwl > > Except that, unless I'm mistaken, using the C collation rules for > languages other than English languages is wrong.
I admit I firstly did not remember at all where that collation rule came from. But after some googling I finally remembered why I put it there. The 'proc' script, shipped with aspell-lang (available from the aspell site), is for aspell upstream the good way to prepare official sources for a new aspell dict. It puts, in the resultant makefile (Makefile.pre) .wl.cwl: cat $< | LC_COLLATE=C sort -u | ${PREZIP} -z > $@ for all languages. If also has some warnings about using sort -u and the C collation rule, -- Agustin