Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
le that enu will be routed to shard1 while deu goes > to shard2, and esp and chs gets indexed in either of them. Or, all of them > can potentially end up getting indexed in the same shard, either 1 or 2, > leaving one shard under-utilized. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-Cloud-sharding-strategy-tp4262274p4262336.html > Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread shamik
ble that enu will be routed to shard1 while deu goes to shard2, and esp and chs gets indexed in either of them. Or, all of them can potentially end up getting indexed in the same shard, either 1 or 2, leaving one shard under-utilized. -- View this message in context: http://lucene.472066.n3.n

Re: Solr Cloud sharding strategy

2016-03-07 Thread Walter Underwood
n grow up to half a TB from its current state. >> Honestly, my perception of "big" index is still vague :-) . All I'm trying >> to make sure is that decision I take is scalable in the long term and will >> be able to sustain the growth without compromising the performance. >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Solr-Cloud-sharding-strategy-tp4262274p4262304.html >> Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
Honestly, my perception of "big" index is still vague :-) . All I'm trying > to make sure is that decision I take is scalable in the long term and will > be able to sustain the growth without compromising the performance. > > > > -- > View this message in context:

Re: Solr Cloud sharding strategy

2016-03-07 Thread shamik
472066.n3.nabble.com/Solr-Cloud-sharding-strategy-tp4262274p4262304.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Cloud sharding strategy

2016-03-07 Thread Erick Erickson
20M docs is actually a very small collection by the "usual" Solr standards unless they're _really_ large documents, i.e. large books. Actually, I wouldn't even shard to begin with, it's unlikely that it's necessary and it adds inevitable overhead. If you _must_ shard, just go with <1>, but again I

Solr Cloud sharding strategy

2016-03-07 Thread Shamik Bandopadhyay
Hi, I'm trying to figure the best way to design/allocate shards for our Solr Cloud environment.Our current index has around 20 million documents, in 10 languages. Around 25-30% of the content is in English. Rest are almost equally distributed among the remaining 13 languages. Till now, we had to