Re: improving score of result set

Alexander Aristov Mon, 29 Oct 2012 12:08:37 -0700

Perhapse this is a XY problem.

First of all I don't have a site which I want to boost. All docs are equal.


Secondly I will explain what I have. I have 100 docs indexed. I do a query
which returns 10 found docs. 8 of them from one site and 2 from other
different sites. I dont like order. Technically scores are good. I
understand why these 8 docs go first - because they havebetter matching.
But i dont like it. I want that articles from smaller collections would
somehow compete with other docs. For other queries situation can change and
another site can produce more results. In that case i would  lower that
site.

I've had a deep thought and think can try grouping.

More insites on my problem. These 8 docs have similar text which matches
query and  thats why they all get similar and relatively high score. For
example docs have text:

1. Red apple felt from tree
2 blue apple felt from tree
3 green apple felt from tree
...
8 orange pineapple felt from tree
9 a boy felt suddenly ill. A tree was green.
10 two pices felt apart and newer collapse. Family tree was reach.

I query "felt tree". Docs 1-8 from one site.

I would like to make the score of docs 9 and 10 higher.

Grouping can help but maybe there are othe solutions.

Alexander
 29.10.2012 22:11 пользователь "Chris Hostetter" <hossman_luc...@fucit.org>
написал:

>
> You've mentioned that you want ot "improve" the scores of these documents,
> but you haven't really given any specifics about when/how/why you wnat to
> improve the score in general -- ie: in this examples you have a total of
> 10 docs, but how do you distinguish the 2 special docs from the 8 other
> docs?  is it because they are the only two docs with some specific
> field value, or is it just because they are in the smaller of two "sets"
> of documents if you partition on some field?  if you added 100 more docs
> that were all in the same set as those two, would you want the other 8
> documents to start getting boosted?
>
> Let's assume that what you are trying to ask is..
>
>   "I want to artificially boost the scores of documents when the 'site'
>    field contains 'cnn.com'"
>
> A simple way to do that is just to add an optional clause to your query
> that matches on "site:cnn.com" so the scores of those documents will be
> increased, but make the "main" part of your query required...
>
>    q=+(your main query) site:cnn.com
>
> Or if you use the dismax or edismax parsers there are special params (bq
> and/or boost) that help make this easy to split out...
>
>
> https://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_increase_the_score_for_specific_documents
>
>
>
> FWIW: this smells like an XY problem ... more details baout your actaul
> situation and end goal would be helpful...
>
> https://people.apache.org/~hossman/#xyproblem
> XY Problem
>
> Your question appears to be an "XY Problem" ... that is: you are dealing
> with "X", you are assuming "Y" will help you, and you are asking about "Y"
> without giving more details about the "X" so that we can understand the
> full issue.  Perhaps the best solution doesn't involve "Y" at all?
> See Also: http://www.perlmonks.org/index.pl?node_id=542341
>
>
>
> -Hoss
>

Re: improving score of result set

Reply via email to