Or even better, don’t remove stopwords. Stopwords are a technique invented for 16-bit machines, where common words made posting lists too long to to handle.
wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Apr 18, 2018, at 2:20 PM, Jay Potharaju <jspothar...@gmail.com> wrote: > > A is part of stopwords ...that is why it got dropped. Protected words will > only stop it from stemming > > https://lucene.apache.org/solr/guide/6_6/language-analysis.html > > Thanks > Jay Potharaju > > > On Wed, Apr 18, 2018 at 11:35 AM, Wendy2 <wendy....@rcsb.org> wrote: > >> Hi fellow Users, >> >> Why did Solr return "Ellington, W.R." when I did a name search for >> "Ellington, A."? >> I even added "A." in the protwords.txt file. The debugQuery shows that the >> middle initial got dropped in the parsedquery. >> How can I make Solr NOT to drop the middle initial? Thanks for your >> help!! >> >> ======Search results======== >> Ellington, A.D. >> Ellington, R.W.. >> >> =======debugQuery========= >> { >> "responseHeader":{ >> "status":0, >> "QTime":51, >> "params":{ >> "q":"\"Ellington, A.\"", >> "indent":"on", >> "fl":"audit_author.name", >> "wt":"json", >> "debugQuery":"true"}}, >> "response":{"numFound":2,"start":0,"docs":[ >> { >> "audit_author.name":"Azzi, A., Clark, S.A., Ellington, R.W., >> Chapman, M.S."}, >> { >> "audit_author.name":"Ye, X., Gorin, A., Ellington, A.D., Patel, >> D.J."}] >> }, >> "debug":{ >> "rawquerystring":"\"Ellington, A.\"", >> "querystring":"\"Ellington, A.\"", >> >> "parsedquery":"(+DisjunctionMaxQuery(((entity_name_com.name: >> ellington)^20.0)))/no_coord", >> "parsedquery_toString":"+((entity_name_com.name:ellington)^20.0)", >> "QParser":"ExtendedDismaxQParser", >> >> >> >> >> -- >> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html >>