On 01/21/2009 12:25 PM, Matthew Runo wrote:
> At a certain level it will become better to have multiple smaller boxes
> rather than one huge one. I've found that even an old P4 with 2 gigs of
> ram has decent response time on our 150,000 item index with only a few
> users - but it quickly goes downhill if we get more than 5 or 6. How
> many documents are you going to be storing in your index? How much of
> them will be "stored" versus "indexed"? Will you be faceting on the
> results?

Thanks for the tip on multiple boxes.  We'll be hosting about 20
databases total.  A couple of them are in the 10- to 20-million record
range and a couple more are in the 5- to 10-million range.  It's highly
structured data and I anticipate a lot of faceting and indexing almost
all the fields.

> 
> In general, I'd recommend a 64 bit processor with enough ram to store
> your index in ram - but that might not be possible with "millions" of
> records. Our 150,000 item index is about a gig and a half when optimized
> but yours will likely be different depending on how much you store.
> Faceting takes more memory than pure searching as well.
> 

This is very helpful.  Thanks again.


-- 
Thomas Dowling

Reply via email to