We have a limit in place to restrict searches to the first ten thousand pages. 
I am going to try to get that number reduced!  I'm thinking even as low as page 
fifty should be the limit. What human (with a wallet) would even go as deep as 
fifty pages?  :)

Sent from my iGizmo


On Dec 8, 2012, at 10:21 AM, "Otis Gospodnetic" <otis.gospodne...@gmail.com> 
wrote:

> It is common practise not to allow drilling deep in search results.
> 
> Otis
> --
> SOLR Performance Monitoring - http://sematext.com/spm
> On Dec 8, 2012 10:27 AM, "Jack Krupansky" <j...@basetechnology.com> wrote:
> 
>> What exactly is the common practice - is there a free, downloadable search
>> component that does that or at least a "blueprint" for "recommended best
>> practice"? What limit is common? (I know Google limits you to the top 1,000
>> results.)
>> 
>> -- Jack Krupansky
>> 
>> -----Original Message----- From: Otis Gospodnetic
>> Sent: Saturday, December 08, 2012 7:25 AM
>> To: solr-user@lucene.apache.org
>> Subject: Re: star searches with high page number requests taking long times
>> 
>> Hi Robert,
>> 
>> You should just prevent deep paging. Humans with wallets don't do that, so
>> you will not lose anything by doing that. It's common practice.
>> 
>> Otis
>> --
>> SOLR Performance Monitoring - http://sematext.com/spm
>> On Dec 7, 2012 8:10 PM, "Petersen, Robert" <rober...@buy.com> wrote:
>> 
>> Hi guys,
>>> 
>>> 
>>> Sometimes we get a bot crawling our search function on our retail web
>>> site.  The ebay crawler loves to do this (Request.UserAgent: Terapeakbot).
>>> They just do a star search and then iterate through page after page. I've
>>> noticed that when they get to higher page numbers like page 9000, the
>>> searches are taking more than 20 seconds.  Is this expected behavior?
>>> We're requesting standard facets with the search as well as incorporating
>>> boosting by function query.  Our index is almost 15 million docs now and
>>> we're on Solr 3.6.1, this isn't causing any errors to occur at the solr
>>> layer but our web layer times out the search after 20 seconds and logs the
>>> exception.
>>> 
>>> 
>>> 
>>> Thanks
>>> 
>>> Robi
>> 

Reply via email to