[ https://issues.apache.org/jira/browse/LUCENE-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17389728#comment-17389728 ]
ASF subversion and git services commented on LUCENE-10031: ---------------------------------------------------------- Commit 42464888c6b42b56b2d20ceb8e65fb08e600f5aa in lucene-solr's branch refs/heads/branch_8x from Adrien Grand [ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=4246488 ] LUCENE-10031: Speed up SortedDocIdMerger on low-cardinality sort fields. (#221) When sorting by low-cardinality fields, the same sub remains current for long sequences of doc IDs. This speeds up SortedDocIdMerger a bit by extracting the sub that leads iteration. > Speedup to SortedDocIDMerger when sorting on low-cardinality fields > ------------------------------------------------------------------- > > Key: LUCENE-10031 > URL: https://issues.apache.org/jira/browse/LUCENE-10031 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Adrien Grand > Priority: Minor > Time Spent: 1h > Remaining Estimate: 0h > > I've been looking at profiles of indexing with index sorting enabled and saw > non-negligible time spent in SortedDocIDMerger. This isn't completely > surprising as this little class is called on every document whenever merging > postings, doc values, stored fields, etc. > I'm especially interested in cases when the sort key is on a low cardinality > field, so the priority queue doesn't get reordered often. I've been playing > with a change to SortedDocIdMerger that makes merging significantly faster in > that case. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org