Take a look at the SPLITSHARD Collections API here:
https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api3

Best value of numShards and replicationFactor: Impossible to say. You have
to stress test respecting your SLAs. See:
https://lucidworks.com/blog/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/

But there's _no_ reason to split your shard if you are getting adequate
response times to queries. In fact, going to more than one shard
will possibly slow your query response as distributed queries add
inevitable overhead.

If you simply want to add more replicas to increase the QPS rate you
can handle. just bring up your Solr nodes and use the Collections API
ADDREPLICA command to your single shard.

Best,
Erick

On Mon, May 23, 2016 at 7:52 AM, Scott Chu <scott....@udngroup.com> wrote:
> I just created a 90gb index collection with 1 shard and 2 replicas on 2 
> nodes. I am to migrate from 2 nodes to 4 node. I am wondering what's the best 
> stragedy to split this single shard? Furthermore, If I am ok to reindex, 
> what's the best adequate experienced value of numShards and 
> replicationFactor? Lastly, I think there's no other way but reindex if I want 
> my data to be evenly distributed into every shard I create, right?
>
> Scott Chu,scott....@udngroup.com
> 2016/5/23 (週一)
>
> P.S. For those who are curious of why I add [scottchu] in subject, the reason 
> is that I want my email filter to route those emails that answer to my 
> question to specific folder.

Reply via email to