Hi, I am indexing about 5M docs in a 4 shard and 1 replica setup. During indexing one of the shards is marked as down in zookeeper but when i tail the logs all the updates are received in the shard and a hard commit at the end of the job also succeeds. (The auto commit is set to trigger every 10 mins or 150K documents). The shard does not recover until i force restart solr on that node. The mem/cpu/load on solr is very less during this time.
How and when does solr try to reconnect to zk? Thanks, Nitin