> Good morning, > > recently I went from version 6.4 to version 8.4.1, I access solerre > through java applications written by me to which I have updated the > solr-solrj-8.4.1.jar libraries. > > I am performing the OCR indexing of a newspaper of about 550,000 pages in > production for which I have calculated at least 1,000,000,000 words and I > am experiencing slowness I wanted to know if you could advise me on changes > to the configuration. > > The server I'm using is a server with 12 cores and 64GB of Ram, the only > changes I made in the configuration are: > Solr.in.sh <http://solr.in.sh/> file > SOLR_HEAP="20480m" > SOLR_JAVA_MEM="-Xms20480m -Xmx20480m" > GC_LOG_OPTS="-verbose:gc -XX:+PrintHeapAtGC -XX:+PrintGCDetails \ > -XX:+PrintGCDateStamps -XX:+PrintGCTimeStamps > -XX:+PrintTenuringDistribution -XX:+PrintGCApplicationStoppedTime" > The Java version I use is > java version "1.8.0_51" > Java(TM) SE Runtime Environment (build 1.8.0_51-b16) > Java HotSpot(TM) 64-Bit Server VM (build 25.51-b03, mixed mode) > > Also comparing the solr web interface I noticed a difference in the > "Overview" page in solr 6.4 it was affected Optimized and Current and > allowed me to launch Optimized if necessary, in version 8.41 Optimized is > no longer present I hypothesized that this activity is done with the commit > or through some operation in the backgroup, if this were so, is it still > necessary to run the Optimize command from my application when I have > finished indexing? I noticed that the Optimized function requires > considerable time and resources especially in large databases > > Thank you for your attention
Massimiliano Randazzo > >