Hi Robert,

That's some old thread from 1969 - that's before my time! :)

I'm not sure what 2+2lemma.txt is... aha, I see it on 
http://wordlist.sourceforge.net/12dicts-readme-r5.html -- a headword + N 
related 
words.  I don't think this will help me tame the overly aggressive Porter 
stemmer, although your sample "stemmer corrections for textTight, the 
plural-only stemmer (via StemmerOverrideFilter)" looks good and like something 
that *would* help me tame Porter.

errata    erratum
news    news
radii      radius
cavalrymen      cavalryman
...

Is the full dictionary you've built available anywhere for download?

Thanks,
Otis
P.S.
I saw that thread at http://search-lucene.com/m/jeWPi1X3FVw started a debate 
over what to include by default, concerns over performance, etc. -- I'd say 
it's 
better to include things like the above and comment it out (if we are afraid of 
poor performance out of the box or some such) than not providing it at all.
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: Robert Muir <rcm...@gmail.com>
> To: solr-user@lucene.apache.org
> Sent: Mon, April 25, 2011 2:20:45 PM
> Subject: Re: Good protwords.txt ?
> 
> On Mon, Apr 25, 2011 at 2:05 PM, Otis Gospodnetic
> <otis_gospodne...@yahoo.com>  wrote:
> > Hi,
> >
> > Are there any good / comprehensive examples  of protwords.txt for English?
> > Or good stemdict.txt examples that work  with StemmerOverrideFilterFactory?
> >
> > Would be good to have a good  example to include in Solr distribution...
> >
> 
> I brought this up a  while ago (as I am probably more than 50-60% done
> with all of this via  2+2lemma.txt) and there was no interest:
> 
>http://www.lucidimagination.com/search/document/180c90276e589d68/solr_example_synonyms_file
>e
> 

Reply via email to