Hello,
Since i replicated my shards (i have 2 cores per shard now), I get a
remarkable decrease in qTime. I assume it happens since my memory has to
split between twice more cores than it used to.

In my low qps rate use-case, I use replications as shard backup only (in
case one of my servers goes down) and not for the ability of serving
parallel requests. In this case i decrease because the two cores of the
shard are active.

I was wondering wether it is possible to query the same core every request,
instead of "load balancing" between the different replicas? And only if the
"leader" replica goes down the second replica would start serving requests.

Cheers,
Manu

Reply via email to