Hi. Any good protwords.txt out there?
In a fairly standard solr analyzer chain, we use the English Porter analyzer like so: <filter class="solr.EnglishPorterFilterFactory" protected="protwords.txt"/> For most purposes the porter does just fine, but occasionally words come along that really don't work out to well, e.g., "maine" is stemmed to "main" - clearly goofing up precision about "Maine" without doing much good for variants of "main". So - I have an entry for my protwords.txt. What else should go in there? Thanks for your ideas, Dave Woodward