> Maybe I got this wrong...but isn't this what mapreduce is meant to deal with?
> eg,
>
> 1) get the job (a query)
> 2) map it to workers ( servers that provide search results from their own
> indexing)
> 3) wait for the results from all workers that reply within acceptable 
> timeframe.
> 4) comb through the lot of  results from all workers, reduce them according to
> your own biz rules (eg, remove dupes, sort them by quality / priority... here 
> possibly relying on the original parameters of the query in 1)
> 5) return the reduced results to the frontend.

That seems to be how Sphinx works:

http://www.sphinxsearch.com/doc.html#distributed

Of course, the details of this are far over my head for either system,
so I don't really know if that's a sensible way of doing things or
not.

Ciao,
-- 
David N. Welton
http://www.welton.it/davidw/

Reply via email to