Hi Robert, That's some old thread from 1969 - that's before my time! :)
I'm not sure what 2+2lemma.txt is... aha, I see it on http://wordlist.sourceforge.net/12dicts-readme-r5.html -- a headword + N related words. I don't think this will help me tame the overly aggressive Porter stemmer, although your sample "stemmer corrections for textTight, the plural-only stemmer (via StemmerOverrideFilter)" looks good and like something that *would* help me tame Porter. errata erratum news news radii radius cavalrymen cavalryman ... Is the full dictionary you've built available anywhere for download? Thanks, Otis P.S. I saw that thread at http://search-lucene.com/m/jeWPi1X3FVw started a debate over what to include by default, concerns over performance, etc. -- I'd say it's better to include things like the above and comment it out (if we are afraid of poor performance out of the box or some such) than not providing it at all. ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ----- Original Message ---- > From: Robert Muir <rcm...@gmail.com> > To: solr-user@lucene.apache.org > Sent: Mon, April 25, 2011 2:20:45 PM > Subject: Re: Good protwords.txt ? > > On Mon, Apr 25, 2011 at 2:05 PM, Otis Gospodnetic > <otis_gospodne...@yahoo.com> wrote: > > Hi, > > > > Are there any good / comprehensive examples of protwords.txt for English? > > Or good stemdict.txt examples that work with StemmerOverrideFilterFactory? > > > > Would be good to have a good example to include in Solr distribution... > > > > I brought this up a while ago (as I am probably more than 50-60% done > with all of this via 2+2lemma.txt) and there was no interest: > >http://www.lucidimagination.com/search/document/180c90276e589d68/solr_example_synonyms_file >e >