Definitely you will want to have more than one box for your index. You can take a look at distributed search and multicore ate the wiki.
2009/1/21 Thomas Dowling <tdowl...@ohiolink.edu> > On 01/21/2009 12:25 PM, Matthew Runo wrote: > > At a certain level it will become better to have multiple smaller boxes > > rather than one huge one. I've found that even an old P4 with 2 gigs of > > ram has decent response time on our 150,000 item index with only a few > > users - but it quickly goes downhill if we get more than 5 or 6. How > > many documents are you going to be storing in your index? How much of > > them will be "stored" versus "indexed"? Will you be faceting on the > > results? > > Thanks for the tip on multiple boxes. We'll be hosting about 20 > databases total. A couple of them are in the 10- to 20-million record > range and a couple more are in the 5- to 10-million range. It's highly > structured data and I anticipate a lot of faceting and indexing almost > all the fields. > > > > > In general, I'd recommend a 64 bit processor with enough ram to store > > your index in ram - but that might not be possible with "millions" of > > records. Our 150,000 item index is about a gig and a half when optimized > > but yours will likely be different depending on how much you store. > > Faceting takes more memory than pure searching as well. > > > > This is very helpful. Thanks again. > > > -- > Thomas Dowling > -- Alexander Ramos Jardim