I surmise that you are using the "text_it" field type, or something similar. It has:

<filter class="solr.ItalianLightStemFilterFactory"/>

When I enter "trento" into the Solr admin analysis page that last filter transforms "trento" into "trent", just as we see in the query explain.

So, indeed, this looks like a stemming anomaly.

I see this comment in the code: "To prevent terms from being stemmed use an instance of KeywordMarkerFilter", so you could use "solr.KeywordMarkerFilterFactory" and created a protected words list text file.

-- Jack Krupansky

-----Original Message----- From: mattia.martine...@gmail.com
Sent: Monday, August 27, 2012 1:18 PM
To: solr-user@lucene.apache.org
Subject: Re: Understanding SOLR search results

Maybe you have a synonym in the title field? Or maybe some stemming anomaly?

The complete title is "Moschea in viale Trento, partono i lavori", so
"Trent" should be a substring of the word "Trento".
But if I search for "Mos" or "lavo", I don't have this result, so I
don't understand why "Trent" is taken as a subword, and "Mos" and
"lavo" are not.

Do you have any idea?

Reply via email to