The SolrRelevancyFAQ does suggest that both index-time and search-time boosting 
can be used to boost the score of newer documents, but doesn't suggest what 
reasons/contexts one might choose one vs the other.  It only provides an 
example of search-time boost though, so it doesn't answer the question of how 
to do an index time boost, if that was a question. 

http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_boost_the_score_of_newer_documents

Sorry, this doesn't answer your question, but does contribute the fact that 
some author of the FAQ at some point considered index-time boost not 
neccesarily unreasonable. 
________________________________________
From: Asif Rahman [a...@newscred.com]
Sent: Friday, June 04, 2010 11:31 PM
To: solr-user@lucene.apache.org
Subject: Re: Index-time vs. search-time boosting performance

It seems like it would be far more efficient to calculate the boost factor
once and store it rather than calculating it for each request in real-time.
Some of our queries match tens of thousands if not hundreds of thousands of
documents in a 15GB index.  However, I'm not well-versed in lucene internals
so I may be misunderstanding what is going on here.


On Fri, Jun 4, 2010 at 8:31 PM, Jay Hill <jayallenh...@gmail.com> wrote:

> I've done a lot of recency boosting to documents, and I'm wondering why you
> would want to do that at index time. If you are continuously indexing new
> documents, what was "recent" when it was indexed becomes, over time "less
> recent". Are you unsatisfied with your current performance with the boost
> function? Query-time recency boosting is a fairly common thing to do, and,
> if done correctly, shouldn't be a performance concern.
>
> -Jay
> http://lucidimagination.com
>
>
> On Fri, Jun 4, 2010 at 4:50 PM, Asif Rahman <a...@newscred.com> wrote:
>
> > Perhaps I should have been more specific in my initial post.  I'm doing
> > date-based boosting on the documents in my index, so as to assign a
> higher
> > score to more recent documents.  Currently I'm using a boost function to
> > achieve this.  I'm wondering if there would be a performance improvement
> if
> > instead of using the boost function at search time, I indexed the
> documents
> > with a date-based boost.
> >
> > On Fri, Jun 4, 2010 at 7:30 PM, Erick Erickson <erickerick...@gmail.com
> > >wrote:
> >
> > > Index time boosting is different than search time boosting, so
> > > asking about performance is irrelevant.
> > >
> > > Paraphrasing Hossman from years ago on the Lucene list (from
> > > memory).
> > >
> > > ...index time boosting is a way of saying this documents'
> > > title is more important than other documents' titles. Search
> > > time boosting is a way of saying "I care about documents
> > > whose titles contain this term more than other documents
> > > whose titles may match other parts of this query"....
> > >
> > > HTH
> > > Erick
> > >
> > > On Fri, Jun 4, 2010 at 5:10 PM, Asif Rahman <a...@newscred.com> wrote:
> > >
> > > > Hi,
> > > >
> > > > What are the performance ramifications for using a function-based
> boost
> > > at
> > > > search time (through bf in dismax parser) versus an index-time boost?
> > > > Currently I'm using boost functions on a 15GB index of ~14mm
> documents.
> > > >  Our
> > > > queries generally match many thousands of documents.  I'm wondering
> if
> > I
> > > > would see a performance improvement by switching over to index-time
> > > > boosting.
> > > >
> > > > Thanks,
> > > >
> > > > Asif
> > > >
> > > > --
> > > > Asif Rahman
> > > > Lead Engineer - NewsCred
> > > > a...@newscred.com
> > > > http://platform.newscred.com
> > > >
> > >
> >
> >
> >
> > --
> > Asif Rahman
> > Lead Engineer - NewsCred
> > a...@newscred.com
> > http://platform.newscred.com
> >
>



--
Asif Rahman
Lead Engineer - NewsCred
a...@newscred.com
http://platform.newscred.com

Reply via email to