Re: Some advice on scalability

2008-05-18 Thread Otis Gospodnetic
Hi, Not that I can think of at the moment. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Norberto Meijome <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Sunday, May 18, 2008 10:31:19 AM > Subject:

Re: Some advice on scalability

2008-05-18 Thread Norberto Meijome
On Thu, 15 May 2008 09:23:03 -0700 "William Pierce" <[EMAIL PROTECTED]> wrote: [...] > > Our app in brief: We get merchant sku files (in either xml/csv) which we > process and index and make available to our site visitors to search. Our > current plan calls for us to support approx 10,000 me

Re: Some advice on scalability

2008-05-18 Thread Norberto Meijome
On Thu, 15 May 2008 12:54:25 -0700 (PDT) Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > 5) Hardware recommendations are hard to do. While people may make > suggestions, the only way to know how *your* hardware works with *your* data > and *your* shards and *your* type of queries is by benchmarki

Re: Some advice on scalability

2008-05-15 Thread William Pierce
Otis: I will take a look at the DistributedSearch page on solr wiki. Thanks, Bill -- From: "Otis Gospodnetic" <[EMAIL PROTECTED]> Sent: Thursday, May 15, 2008 12:54 PM To: Subject: Re: Some advice on scalability Bill, Quick f

Re: Some advice on scalability

2008-05-15 Thread Otis Gospodnetic
Bill, Quick feedback: 1) use 1.3-dev or 1.3 when it comes out, not 1.2 2) you did not mention Solr's distributed search functionality explicitly, so I get a feeling you are not aware of it. See DistributedSearch page on the Solr wiki 3) you definitely don't want a single 500M docs index that

Re: Some advice on scalability

2008-05-15 Thread Alexander Ramos Jardim
I am not a solr expert, but is it possible to try to build indexes based on search statiscs? What is that you could have a monitoring service that would generate statics of search queries, document returns and place weights to each queriy based on ocurrence, impact on the index, time to respond and