Re: Removing duplication: Word lists of common words in languages

2015-01-16 Thread Ben Finney
Ben Finney writes: > Where is a good authoritative source of such words, by frequency, for > various natural languages, suitable for inclusion in Debian as a data > package? The package ‘scowl’ https://packages.debian.org/sid/scowl> looks like a good candidate already in Debian. I will investiga

Re: Removing duplication: Word lists of common words in languages

2014-11-12 Thread Ian Jackson
Ben Finney writes ("Re: Removing duplication: Word lists of common words in languages"): > Ian Jackson writes: > > I had roughly this question in 2013, and found the answer. Here is > > probably the best starting point: > > > > http://www.chiark.greenend.o

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ben Finney
Ian Jackson writes: > I had roughly this question in 2013, and found the answer. Here is > probably the best starting point: > > http://www.chiark.greenend.org.uk/ucgi/~ijackson/git?p=evade-mail-usrlocal.git;a=blob;f=lemma.al-permission.mbox Great! That asks for permission to redistribute the c

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ian Jackson
Ben Finney writes ("Re: Removing duplication: Word lists of common words in languages"): > Where is a good authoritative source of such words, by frequency, for > various natural languages, suitable for inclusion in Debian as a data > package? I had roughly this question in

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ben Finney
Simon McVittie writes: > On 10/11/14 23:16, Ben Finney wrote: > > To avoid duplicating these “the N most common words, ranked by > > frequency, for language FOO” > > For a password generator you ideally want the word-list to be sorted > alphabetically, so that it's trivial to verify "by eye" that

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Simon McVittie
On 10/11/14 23:16, Ben Finney wrote: > To avoid duplicating these “the N most common words, ranked by > frequency, for language FOO” For a password generator you ideally want the word-list to be sorted alphabetically, so that it's trivial to verify "by eye" that there are no duplicates. Duplicate

Removing duplication: Word lists of common words in languages (was: Bug#768772: ITP: xkcdpass …)

2014-11-10 Thread Ben Finney
On 10-Nov-2014, Jonas Smedegaard wrote: > Crypt::XkcdPassword by default uses "the most commonly used words in > film scripts and television shows", and documents examples of > adaptations at . Thank you, it's good to know these exist. I do