On Tue, Sep 19, 2006 at 08:37:33PM +0300, Martin-Éric Racine wrote:
> 
> On Tue, September 19, 2006 13:27, Agustin Martin wrote:
> > You can use 'sed 1d' to strip first line and make this less verbose. This
> > is
> > what I do for the esperanto dictionary (with eo changed to ru):
> >
> > cat ru.dic | sed 1d | LC_COLLATE=C sort -u | prezip > ru.cwl
> 
> Except that, unless I'm mistaken, using the C collation rules for
> languages other than English languages is wrong.

I admit I firstly did not remember at all where that collation rule came
from. But after some googling I finally remembered why I put it there.
The 'proc' script, shipped with aspell-lang (available from the aspell
site), is for aspell upstream the good way to prepare official sources
for a new aspell dict. It puts, in the resultant makefile (Makefile.pre)

.wl.cwl:
        cat $< | LC_COLLATE=C sort -u | ${PREZIP} -z > $@

for all languages. If also has some warnings about using sort -u and the
C collation rule,

-- 
Agustin

Reply via email to