On Tue, Jun 7, 2011 at 9:35 AM, Jamie Johnson <jej2...@gmail.com> wrote: > I am currently experimenting with the Solr Cloud code on trunk and just had > a quick question. Lets say my setup had 3 nodes a, b and c. Node a has > 1000 results which meet a particular query, b has 2000 and c has 3000. When > executing this query and asking for row 900 what specifically happens? From > reading the Distributed Search Wiki I would expect that node a responds with > 900, node b responds with 900 and c responds with 900 and the coordinating > node is responsible for taking the top scored items and throwing away the > rest, is this correct or is there some additional coordination that happens > where nodes a, b and c return back an id and a score and the coordinating > node makes an additional request to get back the documents for the ids which > make up the top list?
The latter is correct - the first phase only collects enough information to merge ids from the shards, and then a second phase requests the stored fields, highlighting, etc for the specific docs that will be returned. -Yonik http://www.lucidimagination.com