On 01/21/2009 12:25 PM, Matthew Runo wrote: > At a certain level it will become better to have multiple smaller boxes > rather than one huge one. I've found that even an old P4 with 2 gigs of > ram has decent response time on our 150,000 item index with only a few > users - but it quickly goes downhill if we get more than 5 or 6. How > many documents are you going to be storing in your index? How much of > them will be "stored" versus "indexed"? Will you be faceting on the > results?
Thanks for the tip on multiple boxes. We'll be hosting about 20 databases total. A couple of them are in the 10- to 20-million record range and a couple more are in the 5- to 10-million range. It's highly structured data and I anticipate a lot of faceting and indexing almost all the fields. > > In general, I'd recommend a 64 bit processor with enough ram to store > your index in ram - but that might not be possible with "millions" of > records. Our 150,000 item index is about a gig and a half when optimized > but yours will likely be different depending on how much you store. > Faceting takes more memory than pure searching as well. > This is very helpful. Thanks again. -- Thomas Dowling