Re: More Like This boost

Francisco Sanmartin Tue, 22 Apr 2008 10:25:11 -0700

Yep, it would be nice for MLT to have this feature, that's why I amtrying to do it from the querys before sending the query to Solr. Theseare the steps I'm following:

1. execute a mlt.like() with the text document_example.getTitle()against the field "Title" of all the other documents. This returns aquery containing the most relevant words in the example_document and inthe rest of documents in the Title. We will call this query"QueryTitle". For example QueryTitle = (words^0.4 in^0.3 the^0.56title^0.65)2. execute a mlt.like() with the text document_example.getDescription()against the field "Description" of all the other documents. This returnsa query containin the most relevant words in the example_document and inthe rest of documents in the Description. We will call this query"QueryDescription". For example QueryDescription = (other^0.66 words^0.7in^0.33 the^0.49 description^0.43)


Up to here, everything is possible with the options that offers MLT.

Now, with the info MLT gave me (QueryTitle and QueryDescription), i wantto look in Solr for the documents (and more filters) to retrieve thebest matches. But I want QueryTitle to be more important thatQueryDescription, for example 70% and 30% respectively. This means thatwe should do QueryTitle^0.70 and QueryDescription^0.30. This meanshaving a query for Solr like this:(words^0.4 in^0.3 the^0.56 title^0.65)^0.70 (other^0.66 words^0.7in^0.33 the^0.49 description^0.43)^0.30

The question is...is Solr able to "understand" a query boosted who hasits terms boosted already? (Remember that MLT returns the "interestingterms" boosted). This does make sense? Will the words obtained from amlt.like() on the title be 70% relevant while the words obtained from amlt.like() on the description will be only 30% relevant?

Of course it would be a nice feature to be able to boost these thingsnatively and do only one call to MLT...Don't hesitate to contact me ifyou need any help on developing this feature.


Thanks!

Pako

Erik Hatcher wrote:

No, the MLT feature does not have that kind of field-specific boostingcapability. It sounds like it could be a useful enhancement though.Of course you do get boosts for "interesting terms" already, but maybehaving an additional field-specific boost would be a nice touch too.
    Erik

On Apr 22, 2008, at 9:13 AM, Francisco Sanmartin wrote:
I know that only one query of that type does not change anything. Butwhen it's two or more with different boosts, i hope it does. Here isthe situation:My docs have "Title" and "Description". What I want to do is to givemore relevancy to the morelikethis on the title than on thedescription. So the query would be like this:
query = (words^0.4 in^0.3 the^0.56 title^0.65)^0.70 (words^0.7in^0.33 the^0.49 description^0.43)^0.30
This way, the words in the title are more relevant than the words inthe description, right?
Thanks!

Pako


Erik Hatcher wrote:
On Apr 21, 2008, at 5:02 PM, Francisco Sanmartin wrote:
Is it possible to boost the query that MoreLikeThis returns beforesending it to Solr? I mean, technically is possible, because youcan add a factor to the whole query but...does it make sense?(Remember that MoreLikeThis can already boosts each term inside thequery).
For example, this could be a result of MoreLikeThis (with nativeboosting enabled)
queryResultMLT = (this^0.4 is^0.5 a^0.6 query^0.33 of^0.29morelikethis^0.67)
what I want to do is
queryResulltMLT = (this^0.4 is^0.5 a^0.6 query^0.33 of^0.29morelikethis^0.67)^0.60 <---(notice the boost of 0.60 for thewhole query)
That last boost wouldn't change the doc ordering at all, so it'd bekinda useless.
What are you trying to accomplish?

    Erik

Re: More Like This boost

Reply via email to