Yep it is / was, solved now.
Thanks Grant and Yonik.

Published the setup here: http://dev.tailsweep.com/solr-external-scoring/en/

//Marcus

On Fri, Apr 24, 2009 at 6:34 PM, Grant Ingersoll <gsing...@apache.org>wrote:

> See also my comment on your PageRank question, which I take is the same
> issue.  Namely, you might have a look into the external file stuff.  I've
> used that to store oft-changing boost information that then factors into the
> score.
>
> -Grant
>
>
> On Apr 24, 2009, at 12:03 PM, Michael McCandless wrote:
>
>  I think something like this (NOTE: not at all tested, and I have no
>> real experience with function queries):
>>
>>  ValueSource vals = new MyPageRankScores(...);
>>  ValueSourceQuery prQuery = new ValueSourceQuery(vals);
>>  Query realQuery = get-user's-query;
>>  Query q = new CustomScoreQuery(realQuery, prQuery);
>>  TopDocs hits = searcher.search(q, 10);
>>
>> MyPageRankScores is your class, subclassing ValueSource and implementing
>> the
>> getValues method.
>>
>> You could subclass CustomScoreQuery if you want to tweak just how the
>> "real" Query scores and your page-rank scores are combined.
>>
>> Mike
>>
>> On Fri, Apr 24, 2009 at 5:20 AM, Marcus Herou
>> <marcus.he...@tailsweep.com> wrote:
>>
>>> Yes I am thinking of something like that.
>>>
>>> Could you elaborate on how that would look like pseudo wise ?
>>>
>>> Kindly
>>>
>>> //Marcus
>>>
>>> On Fri, Apr 24, 2009 at 9:05 AM, Michael McCandless <
>>> luc...@mikemccandless.com> wrote:
>>>
>>>  Could function queries be used here?  EG you could implement a
>>>> ValueSource that pulls in the external scores?
>>>>
>>>> Mike
>>>>
>>>> On Thu, Apr 23, 2009 at 4:01 PM, Marcus Herou
>>>> <marcus.he...@tailsweep.com> wrote:
>>>>
>>>>> Hi.
>>>>>
>>>>> Confusing subject eh ? Trying to become a little clearer in a few
>>>>>
>>>> sentences.
>>>>
>>>>>
>>>>> We have a Solr/Lucene index where each document is a Blog Entry. We
>>>>> have
>>>>> just implemented the PageRank algorithm for Blogs and are about to add
>>>>> a
>>>>> column to the index called score and perhaps adjust the document boost.
>>>>>
>>>>> We have as well decided that it is the blog itself and not the
>>>>> individual
>>>>> pages that are to be ranked so all entries belonging to one blog will
>>>>> receive the same score.
>>>>>
>>>>> I have not found a way to apply a document score without actually
>>>>> re-indexing all fields in the affected entries (could very well be 100%
>>>>>
>>>> at
>>>>
>>>>> every PageRank recalculation) and this will of course take hell of a
>>>>> long
>>>>> time to reindex which effectively will render the process useless since
>>>>>
>>>> it
>>>>
>>>>> would take a week or of reindexing as of current and will take more and
>>>>>
>>>> more
>>>>
>>>>> time. (100M blog entries as of current and rapidly increasing).
>>>>>
>>>>> Guess we have run into the issue where we have some "static" data which
>>>>>
>>>> we
>>>>
>>>>> do not want to touch at all but we want to update certain "dynamic"
>>>>>
>>>> fields.
>>>>
>>>>>
>>>>> Lucene is not a database I know but is there a way to implement
>>>>> external
>>>>> search-time scoring or update individual fields ? Would there be a
>>>>> possibilty to do some kind of join (parallell searches separate index
>>>>>
>>>> types)
>>>>
>>>>> ? or send the result to a separate sorting algorithm ? Hmmm.... Perhaps
>>>>> a
>>>>> subclass of Sort ? Grasping at straws here folks...
>>>>>
>>>>> Hope anyone of the core experts can help us.
>>>>>
>>>>> Cheers
>>>>>
>>>>> //Marcus Herou
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Marcus Herou CTO and co-founder Tailsweep AB
>>>>> +46702561312
>>>>> marcus.he...@tailsweep.com
>>>>> http://www.tailsweep.com/
>>>>> http://blogg.tailsweep.com/
>>>>>
>>>>>
>>>>
>>>
>>>
>>> --
>>> Marcus Herou CTO and co-founder Tailsweep AB
>>> +46702561312
>>> marcus.he...@tailsweep.com
>>> http://www.tailsweep.com/
>>> http://blogg.tailsweep.com/
>>>
>>>
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using
> Solr/Lucene:
> http://www.lucidimagination.com/search
>
>


-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
marcus.he...@tailsweep.com
http://www.tailsweep.com/
http://blogg.tailsweep.com/

Reply via email to