Hello all, We're running Solr 7.3.1 on Docker, trying to save the indexing information on Ceph Storage using HDFS + Hadoop-AWS S3A filesystem client. Currently, we start 2 Solr instances and 3 Zookeepers.
When Solr is started, we create a test collection with 2 shards and a replication factor of 2. Everything works fine and the Ceph Buckets are populated correctly. We can see files in Ceph like: testcollection/core_node8/data/index/_0.fdt 111 2018-08-01T14:45:18.038Z testcollection/core_node8/data/index/_0.fdx 83 2018-08-01T14:45:16.604Z testcollection/core_node8/data/index/_0.fnm 427 2018-08-01T14:45:22.738Z However, when we restart one of the containers, the recovery process apparently duplicates the "dataDir" configuration, and we start to see additional files like: testcollection/core_node7/s3a:/bucketname/testcollection/core_node8/data/index/_0.fdt 111 2018-08-01T14:54:39.361Z testcollection/core_node7/s3a:/bucketname/testcollection/core_node8/data/index/_0.fdx 83 2018-08-01T14:54:32.669Z testcollection/core_node7/s3a:/bucketname/testcollection/core_node8/data/index/_0.fnm 427 2018-08-01T14:54:58.761Z Where "s3a:/bucketname" is the "solr.hdfs.home" value configured in solr.in.sh. We also noticed that before the restart, the core.properties file does not have the "dataDir" property configured. After the restart, the container has this property defined as "s3a:/bucketname/testcollection/core_node8/data". Is this behaviour correct, even if the index files are being duplicated again and again in every restart? What could be causing this? Thanks for your help, Joaquim Oliveira - "Esta mensagem do SERVIÇO FEDERAL DE PROCESSAMENTO DE DADOS (SERPRO), empresa pública federal regida pelo disposto na Lei Federal nº 5.615, é enviada exclusivamente a seu destinatário e pode conter informações confidenciais, protegidas por sigilo profissional. Sua utilização desautorizada é ilegal e sujeita o infrator às penas da lei. Se você a recebeu indevidamente, queira, por gentileza, reenviá-la ao emitente, esclarecendo o equívoco." "This message from SERVIÇO FEDERAL DE PROCESSAMENTO DE DADOS (SERPRO) -- a government company established under Brazilian law (5.615/70) -- is directed exclusively to its addressee and may contain confidential data, protected under professional secrecy rules. Its unauthorized use is illegal and may subject the transgressor to the law's penalties. If you're not the addressee, please send it back, elucidating the failure."