Interesting.

Most of the text fields are single word fields or close to it, but on some
of the documents, long text appears.

How long does a text need to be before hitting length normalization?


On Tue, Apr 8, 2014 at 11:36 AM, Ahmet Arslan <iori...@yahoo.com> wrote:

> Hi Nielsen,
>
> There is no special attention paid to first word. You are probably hitting
> length normalisation.
> Lucene/Solr punishes long documents, favours short documents.
> (5 times appearing one) longer?
>
>
>
> On Tuesday, April 8, 2014 12:03 PM, John Nielsen <j...@mcb.dk> wrote:
> Hi,
>
> We are seeing a strange phenomenon with our Solr setup which I have been
> unable to answer.
>
> My Google-fu is clearly not up to the task, so I am trying here.
>
> It appears that if i do a freetext search for a single word, say
> "modellering"
> on a text field, the scoring is massively boosted if the first word of the
> text field is a hit.
>
> For instance if there is only one occurrence of the word "modellering" in
> the text field and that occurrence is the first word of the text, then that
> document gets a higher relevancy than if the word "modelling" occurs 5
> times in the text and the first word of the text is any other word.
>
> Is this normal behavior? Is special attention paid to the first word in a
> text field? I would think that the latter case would get the highest score.
>
>
> --
> Med venlig hilsen / Best regards
>
> *John Nielsen*
> Programmer
>
>
>
> *MCB A/S*
> Enghaven 15
> DK-7500 Holstebro
>
> Kundeservice: +45 9610 2824
> p...@mcb.dk
> www.mcb.dk
>
>


-- 
Med venlig hilsen / Best regards

*John Nielsen*
Programmer



*MCB A/S*
Enghaven 15
DK-7500 Holstebro

Kundeservice: +45 9610 2824
p...@mcb.dk
www.mcb.dk

Reply via email to