On 12/5/2018 5:14 AM, ashoknix wrote:
curl -v
'http://localhost:8980/solr/admin/collections?action=SPLITSHARD&collection=billdocs&shard=shard1&async=2000'
<lst name="responseHeader"><int name="status">500</int><int
name="QTime">300009</int></lst><lst name="error"><str name="msg">splitshard
the collection time out:300s</str><str
name="trace">org.apache.solr.common.SolrException: splitshard the collection
time out:300s
<snip>
1. Currently index size is around 40GB.
2. Right now it has single shard - we observe query times high.
3. Does SPLITSHARD helps here with query times? Since docs gets
distributed
You're trying to make the call async. This is a good idea... but async
capability for the collections API was added in Solr 4.8.
https://issues.apache.org/jira/browse/SOLR-5477
Which means that in version 4.4, any collections API action that takes
longer than your collections API timeout is going to return this error.
Your timeout appears to be 300 seconds. I do not know whether the
splitshard will continue to operate on the server in this situation or not.
Once you have successfully split your index, the following will apply:
Increasing the shard count will increase the amount of work that Solr
must do to execute a query. If your query rate is very low and your
system has idle CPUs, then the query might complete faster. If your
query rate is high or you do not have idle CPUs, then splitting shards
will make your queries take longer.
Because the latest version of Solr is 7.5.0, I would not recommend
running any 4.x version. There is zero possibility of bugs in 4.x
getting developer attention. Bugs in 6.6.x MIGHT get attention, but
mostly only bugs in the current major release will be addressed.
Thanks,
Shawn