Good morning, recently I went from version 6.4 to version 8.4.1, I access solerre through java applications written by me to which I have updated the solr-solrj-8.4.1.jar libraries.
I am performing the OCR indexing of a newspaper of about 550,000 pages in production for which I have calculated at least 1,000,000,000 words and I am experiencing slowness I wanted to know if you could advise me on changes to the configuration. The server I'm using is a server with 12 cores and 64GB of Ram, the only changes I made in the configuration are: Solr.in.sh <http://solr.in.sh/> file SOLR_HEAP="20480m" SOLR_JAVA_MEM="-Xms20480m -Xmx20480m" GC_LOG_OPTS="-verbose:gc -XX:+PrintHeapAtGC -XX:+PrintGCDetails \ -XX:+PrintGCDateStamps -XX:+PrintGCTimeStamps -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime" The Java version I use is java version "1.8.0_51" Java(TM) SE Runtime Environment (build 1.8.0_51-b16) Java HotSpot(TM) 64-Bit Server VM (build 25.51-b03, mixed mode) Also comparing the solr web interface I noticed a difference in the "Overview" page in solr 6.4 it was affected Optimized and Current and allowed me to launch Optimized if necessary, in version 8.41 Optimized is no longer present I hypothesized that this activity is done with the commit or through some operation in the backgroup, if this were so, is it still necessary to run the Optimize command from my application when I have finished indexing? I noticed that the Optimized function requires considerable time and resources especially in large databases Thank you for your attention-- Inviato da Gmail Mobile