> Maybe I got this wrong...but isn't this what mapreduce is meant to deal with?
> eg,
>
> 1) get the job (a query)
> 2) map it to workers ( servers that provide search results from their own
> indexing)
> 3) wait for the results from all workers that reply within acceptable
> timeframe.
> 4) comb
Hi,
I'm in the process of evaluating solr and sphinx, and have come to
realize that actually having a large data set to run them against
would be handy. However, I'm pretty new to both systems, so thought
that perhaps asking around my produce something useful.
What *I* mean by largish is somethi