Interesting. Most of the text fields are single word fields or close to it, but on some of the documents, long text appears.
How long does a text need to be before hitting length normalization? On Tue, Apr 8, 2014 at 11:36 AM, Ahmet Arslan <iori...@yahoo.com> wrote: > Hi Nielsen, > > There is no special attention paid to first word. You are probably hitting > length normalisation. > Lucene/Solr punishes long documents, favours short documents. > (5 times appearing one) longer? > > > > On Tuesday, April 8, 2014 12:03 PM, John Nielsen <j...@mcb.dk> wrote: > Hi, > > We are seeing a strange phenomenon with our Solr setup which I have been > unable to answer. > > My Google-fu is clearly not up to the task, so I am trying here. > > It appears that if i do a freetext search for a single word, say > "modellering" > on a text field, the scoring is massively boosted if the first word of the > text field is a hit. > > For instance if there is only one occurrence of the word "modellering" in > the text field and that occurrence is the first word of the text, then that > document gets a higher relevancy than if the word "modelling" occurs 5 > times in the text and the first word of the text is any other word. > > Is this normal behavior? Is special attention paid to the first word in a > text field? I would think that the latter case would get the highest score. > > > -- > Med venlig hilsen / Best regards > > *John Nielsen* > Programmer > > > > *MCB A/S* > Enghaven 15 > DK-7500 Holstebro > > Kundeservice: +45 9610 2824 > p...@mcb.dk > www.mcb.dk > > -- Med venlig hilsen / Best regards *John Nielsen* Programmer *MCB A/S* Enghaven 15 DK-7500 Holstebro Kundeservice: +45 9610 2824 p...@mcb.dk www.mcb.dk