: ?I have a SolrCloud cluster with two servers and I created a collection using 
two shards with this command:
        ...
: There were 230 documents in the set I indexed and there were 3 different 
prefixes (RM!, WW! and BH!) but all were routed into the same shard. Is there 
anything I can do to debug this further?

I'm not really a math expert but...

If you have N (2) shards, and a single prefix ("RM") there is a 100% 
chance that that prefix will hash into 1 of those N=2 shards.

For a 2nd prefix ("WW") there is a 1/N (1/2) chance that it will hash into 
the same shard as your first prefix ("RM").

Likewise, there is a 1/N (1/2) chance that any other prefix ("BH") will 
hash into the same hard as your first prefix ("RM").

Which means there is a 25% (1/2 * 1/2 = 1/4) chance tha 3 randomly 
selected prefixes will all hash to the same shard.

(In general, if you have N shards, and P # of unique prefixes, then the 
odds that they all wind up in the same shard is going to be: 
"(1/N)**(P-1)")

So i suspect you just go unlucky with the 3 prefixes you happen to try in 
your small test.






-Hoss
http://www.lucidworks.com/

Reply via email to