Hi, I have not worked with a 50 node Solr cluster, but I've worked with pure Lucene clusters of that size, very high query and data volumes. I don't imagine a dist search involving 50 nodes will be a problem for Solr. As for handling query slave failures, I'm sure you'll want to involve a LB that can detect those, and have multiple replicas of each query node behind it for fail-over.
As for the manageability, I think you'll find that management is really mostly on you - Solr doesn't provide tools for cluster / shard management. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: souravm <[EMAIL PROTECTED]> > To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> > Sent: Sunday, December 7, 2008 12:40:34 AM > Subject: Limitations of Distributed Search .... > > Hi, > > We are planning to use Solr for processing large volume of application log > files > (around ~ 10 Billions documents of size 5-6 TB). > > One of the approach we are considering for the same is to use Distributed > Search > extensively. > > What we have in mind is distributing the log files in multiple boxes monthly > or > weekly basis - where at the weekly basis itself the volume can go to the > level > of 200 M of documents. And a search query can spread across all weeks (e.g. > number of a given txn for 1st 6 months of a year) > > However, what we are not sure how well the distributed search would scale > when > we may use around 50-60 boxes to distribute indexed documents on weekly > basis. > The specific questions I have in mind are - > > a) How would be the impact on the performance when a query spreads over 50 > boxes > b) Is there any hard limit on the number of slaves which can be contacted > from > the master server? > c) How much load will this type of approach create on master server for > merging > data, keeping the track whether a slave is down or not > d) Any other manageability issues with so many slaves > > If anyone of you have deployed Solr in such a environment it would be great > if > you can share your experience on the same. > > Thanks in advance. > > Regards, > Sourav > > > > **************** CAUTION - Disclaimer ***************** > This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely > for the use of the addressee(s). If you are not the intended recipient, > please > notify the sender by e-mail and delete the original message. Further, you are > not > to copy, disclose, or distribute this e-mail or its contents to any other > person > and > any such actions are unlawful. This e-mail may contain viruses. Infosys has > taken > every reasonable precaution to minimize this risk, but is not liable for any > damage > you may sustain as a result of any virus in this e-mail. You should carry out > your > own virus checks before opening the e-mail or attachment. Infosys reserves > the > right to monitor and review the content of all messages sent to or from this > e-mail > address. Messages sent to or from this e-mail address may be stored on the > Infosys e-mail system. > ***INFOSYS******** End of Disclaimer ********INFOSYS***