At a certain level it will become better to have multiple smaller boxes rather than one huge one. I've found that even an old P4 with 2 gigs of ram has decent response time on our 150,000 item index with only a few users - but it quickly goes downhill if we get more than 5 or 6. How many documents are you going to be storing in your index? How much of them will be "stored" versus "indexed"? Will you be faceting on the results?

In general, I'd recommend a 64 bit processor with enough ram to store your index in ram - but that might not be possible with "millions" of records. Our 150,000 item index is about a gig and a half when optimized but yours will likely be different depending on how much you store. Faceting takes more memory than pure searching as well.

I'm sure that we could work out some better suggestions with more information about your use case.

http://www.nabble.com/Solr---User-f14480.html is a great place to go for searching the solr user list.

-Matthew

On Jan 21, 2009, at 8:55 AM, Thomas Dowling wrote:

Is there a useful guide somewhere that suggests system configurations
for machines that will support multiple large-ish Solr indexes?  I'm
working on a group of library databases (journal article citations +
abstracts, mostly), and need to provide some sort of helpful information
to our hardware people.  Other than "lots", is there an answer for "We
have X millions of records, of Y average size, with Z peak simultaneous users, so the memory needed for reasonable search performance is _____"? Or is the limiting factor on search performance going to be something else?

[Standard caveat: I did try checking the solr-user archives, but was
hampered by the fact that there's no search function.  The cobbler's
children go barefoot.]


--
Thomas Dowling
Ohio Library and Information Network
tdowl...@ohiolink.edu


Reply via email to