On Fri, Dec 07, 2007 at 08:03:31PM +0100, Marc Coll wrote: > Package: wcatalan > Version: 0.5-3 > Severity: normal > > The wordlist file is suprisingly big compared to the same file in english or > spanish > (7.5 MB comapred to less than 1 MB). The cause seems to be the fact that > there are a > lot of repeated words. A few examples are: abacallani, embalsameu, embali... > > I'm currently working on a little program which should be able to find and > remove all > duplicated occurences. I'll send the corrected version of the file to the > package > maintainer as soon as I get it to work.
I do not have the sources here, but a combination of sort and uniq during the build process should do the trick. Will try looking at this next week, -- Agustin -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]