I _think_ this is a distinction between serving the query and processing the results. The query is the standard Solr processing returning results from one replica per shard.
Those results can be partitioned out to N Solr instances for sub-processing, where N is however many worker nodes you specified that may or may not be host to any replicas of that collection. At least I think that's what's up, but then again this is new to me too. Which bits of the doc anyway? Sounds like some clarification is in order. Best, Erick On Mon, May 23, 2016 at 9:32 AM, Timothy Potter <thelabd...@gmail.com> wrote: > I've seen docs and diagrams that seem to indicate a streaming > expression can utilize all replicas of a shard but I'm seeing only 1 > replica per shard (I have 2) being queried. > > All replicas are on the same host for my experimentation, could that > be the issue? What are the circumstances where all replicas will be > utilized? > > Or is this a mis-understanding of the docs?