Hello,
I had some differences in solr score between solr 3.1 and solr 4.
I have a searchfield with the following type:
<fieldType name="text_delimiter" class="solr.TextField">
<analyzer type="index">
<charFilter class="solr.HTMLStripCharFilterFactory"/>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.ASCIIFoldingFilterFactory"/>
<filter class="solr.TrimFilterFactory"/>
<filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" catenateWords="1" splitOnCaseChange="0"
splitOnNumerics="0" stemEnglishPossessive="0" />
</analyzer>
<analyzer type="query">
<charFilter class="solr.HTMLStripCharFilterFactory"/>
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.ASCIIFoldingFilterFactory"/>
<filter class="solr.TrimFilterFactory"/>
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
catenateWords="0" splitOnCaseChange="0" splitOnNumerics="0"
stemEnglishPossessive="0" />
</analyzer>
</fieldType>
An example of fieldnorms:
SearchTerm = *barcelona*
solr 3.1:
fc *barcelona* soccer club -> 0.5
fc-*barcelona* soccer club -> 0.5
solr 4:
fc *barcelona* soccer club -> 0.5
fc-*barcelona* soccer club -> 0.4375
It could be the catenateWords of the fieldtype conf:
fc,barcelona,fcbarcelona,soccer,club(5 terms = 0.4375)
Strange that in solr 3.1 it was just counting for 4 terms with the same
filter.
Why is fieldnorm different? I need some help with this:)
Thanks
Roy
--
View this message in context:
http://lucene.472066.n3.nabble.com/Fieldnorm-solr-4-specialchars-worddelimiter-tp4036248.html
Sent from the Solr - User mailing list archive at Nabble.com.