gf2121 commented on PR #12604: URL: https://github.com/apache/lucene/pull/12604#issuecomment-1741082917
Here is the young GC statistics and allocation profile after indexing`wikimedium10m` (without facets and dvs) <!--StartFragment--><byte-sheet-html-origin data-id="1696000956100" data-version="4" data-is-embed="false" data-grid-line-hidden="false" data-importRangeRawData-spreadSource="https://bytedance.feishu.cn/sheets/G5dwsdvZ7hOxXftyfDkcvUkYnqB" data-importRangeRawData-range="'Sheet1'!A4:D6"> | main | patch | diff -- | -- | -- | -- Time in Young Generation GC | 1245 | 864 | -30.60% Collections | 525 | 321 | -38.86% </byte-sheet-html-origin><!--EndFragment--> **Baseline Allocation Profile** ``` PERCENT HEAP SAMPLES STACK 39.26% 125353M org.apache.lucene.util.fst.BytesStore#writeByte() 5.33% 17008M org.apache.lucene.codecs.lucene90.Lucene90PostingsWriter#newTermState() 5.07% 16188M java.util.Arrays#copyOfRange() 4.33% 13832M java.lang.StringUTF16#compress() 3.97% 12688M java.util.HashMap#newNode() 3.43% 10936M org.apache.lucene.util.ByteBlockPool$DirectTrackingAllocator#getByteBlock() 3.28% 10461M org.apache.lucene.index.FreqProxTermsWriterPerField$FreqProxPostingsArray#<init>() 2.70% 8635M java.util.Arrays#copyOf() 2.60% 8307M org.apache.lucene.index.ParallelPostingsArray#<init>() 2.43% 7743M org.apache.lucene.util.BytesRef#<init>() 2.02% 6435M org.apache.lucene.util.LongHeap#<init>() 1.94% 6202M java.lang.String#<init>() 1.71% 5451M org.apache.lucene.codecs.lucene90.blocktree.Lucene90BlockTreeTermsWriter$PendingTerm#<init>() 1.54% 4905M org.apache.lucene.codecs.lucene90.Lucene90NormsProducer#getNorms() 1.50% 4779M org.apache.lucene.codecs.lucene90.blocktree.Lucene90BlockTreeTermsWriter$TermsWriter#write() 1.26% 4025M java.util.ArrayList#grow() 1.20% 3835M org.apache.lucene.util.TimSorter#<init>() 0.99% 3163M java.util.HashMap#resize() 0.97% 3089M org.apache.lucene.util.fst.FSTCompiler#<init>() 0.77% 2464M org.apache.lucene.util.ArrayUtil#growExact() 0.72% 2292M org.apache.lucene.util.fst.FSTCompiler$UnCompiledNode#<init>() 0.72% 2286M org.apache.lucene.util.fst.FSTEnum#getArc() 0.63% 1998M org.apache.lucene.util.BytesRefHash#rehash() 0.59% 1887M java.util.regex.Matcher#<init>() 0.58% 1848M org.apache.lucene.codecs.lucene90.PForUtil#encode() 0.55% 1758M java.text.CalendarBuilder#<init>() 0.52% 1674M java.text.SimpleDateFormat#subParse() 0.42% 1334M org.apache.lucene.codecs.CompetitiveImpactAccumulator#getCompetitiveFreqNormPairs() 0.39% 1252M java.lang.StringBuffer#toString() 0.39% 1251M java.text.DecimalFormat#parse() ``` **Candidate Allocation Profile** ``` PERCENT HEAP SAMPLES STACK 8.56% 16863M org.apache.lucene.codecs.lucene90.Lucene90PostingsWriter#newTermState() 8.27% 16280M java.util.Arrays#copyOfRange() 6.89% 13577M java.lang.StringUTF16#compress() 6.31% 12423M java.util.HashMap#newNode() 5.59% 11007M org.apache.lucene.index.FreqProxTermsWriterPerField$FreqProxPostingsArray#<init>() 5.40% 10633M org.apache.lucene.util.ByteBlockPool$DirectTrackingAllocator#getByteBlock() 4.39% 8637M org.apache.lucene.index.ParallelPostingsArray#<init>() 4.13% 8129M java.util.Arrays#copyOf() 3.61% 7117M org.apache.lucene.util.BytesRef#<init>() 3.55% 6985M org.apache.lucene.util.LongHeap#<init>() 3.17% 6236M java.lang.String#<init>() 2.84% 5584M org.apache.lucene.codecs.lucene90.blocktree.Lucene90BlockTreeTermsWriter$PendingTerm#<init>() 2.59% 5101M org.apache.lucene.codecs.lucene90.blocktree.Lucene90BlockTreeTermsWriter$TermsWriter#write() 2.52% 4970M org.apache.lucene.codecs.lucene90.Lucene90NormsProducer#getNorms() 2.10% 4131M org.apache.lucene.util.fst.BytesStore#writeByte() 1.92% 3788M org.apache.lucene.util.TimSorter#<init>() 1.72% 3382M java.util.HashMap#resize() 1.54% 3041M org.apache.lucene.util.fst.FSTCompiler#<init>() 1.31% 2574M org.apache.lucene.util.ArrayUtil#growExact() 1.21% 2374M org.apache.lucene.util.fst.FSTCompiler$UnCompiledNode#<init>() 1.04% 2049M org.apache.lucene.util.fst.FSTEnum#getArc() 0.98% 1929M java.text.CalendarBuilder#<init>() 0.98% 1926M java.util.ArrayList#grow() 0.98% 1926M org.apache.lucene.util.BytesRefHash#rehash() 0.85% 1677M org.apache.lucene.codecs.lucene90.PForUtil#encode() 0.84% 1647M java.text.SimpleDateFormat#subParse() 0.78% 1543M java.util.regex.Matcher#<init>() 0.71% 1402M java.text.DecimalFormat#parse() 0.70% 1376M org.apache.lucene.codecs.CompetitiveImpactAccumulator#getCompetitiveFreqNormPairs() 0.68% 1338M sun.util.locale.provider.DateFormatSymbolsProviderImpl#getInstance() ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org