Re: Optimize solr 8.4.1

2020-02-26 Thread Erick Erickson
As long as you have an http connection, you can use the replication API fetchindex command to, well, fetch an index. But that copies the index but does not shard it. I guess you could fetch into a single shard collection and then use splitshard. All that said, you'll have to reindex sometime if yo

Re: Optimize solr 8.4.1

2020-02-26 Thread Dario Rigolin
Hi Massimiliano, the only way to reindex is to resend all documents to the indexer of the Cloud instance. At the moment solr doesn't have the ability to do it when the schema is changed or to "send" indexed data to a SolrCloud from a non cloud . For example we have in solr a field with an only sto

Re: Optimize solr 8.4.1

2020-02-26 Thread Massimiliano Randazzo
Hi Paras, thank you for your answer if you don't mind I would have a couple of questions I am experiencing very long indexing times I have 8 servers for currently working on 1 instance of Solr, I thought of moving to a cloud of 4 solr servers with 3 zookeeeper servers to distribute the load but I

Re: Optimize solr 8.4.1

2020-02-26 Thread Paras Lehana
Hi Massimiliano, Is it still necessary to run the Optimize command from my application when > I have finished indexing? I guess you can stop worrying about optimizations and let Solr handle that implicitly. There's nothing so bad about having more segments. On Wed, 26 Feb 2020 at 16:02, Massimi

Optimize solr 8.4.1

2020-02-26 Thread Massimiliano Randazzo
> Good morning, > > recently I went from version 6.4 to version 8.4.1, I access solerre > through java applications written by me to which I have updated the > solr-solrj-8.4.1.jar libraries. > > I am performing the OCR indexing of a newspaper of about 550,000 pages in > production for which I have