At a certain level it will become better to have multiple smaller
boxes rather than one huge one. I've found that even an old P4 with 2
gigs of ram has decent response time on our 150,000 item index with
only a few users - but it quickly goes downhill if we get more than 5
or 6. How many documents are you going to be storing in your index?
How much of them will be "stored" versus "indexed"? Will you be
faceting on the results?
In general, I'd recommend a 64 bit processor with enough ram to store
your index in ram - but that might not be possible with "millions" of
records. Our 150,000 item index is about a gig and a half when
optimized but yours will likely be different depending on how much you
store. Faceting takes more memory than pure searching as well.
I'm sure that we could work out some better suggestions with more
information about your use case.
http://www.nabble.com/Solr---User-f14480.html is a great place to go
for searching the solr user list.
-Matthew
On Jan 21, 2009, at 8:55 AM, Thomas Dowling wrote:
Is there a useful guide somewhere that suggests system configurations
for machines that will support multiple large-ish Solr indexes? I'm
working on a group of library databases (journal article citations +
abstracts, mostly), and need to provide some sort of helpful
information
to our hardware people. Other than "lots", is there an answer for "We
have X millions of records, of Y average size, with Z peak
simultaneous
users, so the memory needed for reasonable search performance is
_____"?
Or is the limiting factor on search performance going to be
something else?
[Standard caveat: I did try checking the solr-user archives, but was
hampered by the fact that there's no search function. The cobbler's
children go barefoot.]
--
Thomas Dowling
Ohio Library and Information Network
tdowl...@ohiolink.edu