FYI, I searched the google for this problem but didn't find any satisfactory
answer.Here is the current situation : I have the 8 shards in my solr cloud
backed up with 3 zookeeper all are setup on AWS EC2 instances, all 8 are leader
with no replicas.I have only 1 collection say collection1 divided in 8 shards,
i have configured the index and tlog folder on each server pointing into 1TB
EBS disk attached to each servers, all 8 servers are having around 100GB for
index folder each. so total index files i have is ~800Gb.Now, i want to move
all the data to HDFS, so I am going to setup the HDFS on all 8 serversMerge all
the indexes from 8 serversPut in HDFS.Stop and Start my all solr servers on
HDFS to access that common index data with setting below cp parameter and few
more.-Dsolr.directoryFactory=HdfsDirectoryFactory -Dsolr.lock.type=hdfs
-Dsolr.data.dir=hdfs://host:port/path
-Dsolr.updatelog=hdfs://host:port/path -jarNow could you tell me is this
correct approach? if yes how can i merge all indices from 8 server
?Regards,Amey