[ 
https://issues.apache.org/jira/browse/LUCENE-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418074#comment-17418074
 ] 

Adrien Grand commented on LUCENE-10031:
---------------------------------------

This change decreased overall merge times by 16% in the sorted case for the NYC 
Taxis benchmark: 
http://people.apache.org/~mikemccand/lucenebench/sparseResults.html#tot_merge_times.

> Speedup to SortedDocIDMerger when sorting on low-cardinality fields
> -------------------------------------------------------------------
>
>                 Key: LUCENE-10031
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10031
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>             Fix For: 8.10
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> I've been looking at profiles of indexing with index sorting enabled and saw 
> non-negligible time spent in SortedDocIDMerger. This isn't completely 
> surprising as this little class is called on every document whenever merging 
> postings, doc values, stored fields, etc.
> I'm especially interested in cases when the sort key is on a low cardinality 
> field, so the priority queue doesn't get reordered often. I've been playing 
> with a change to SortedDocIdMerger that makes merging significantly faster in 
> that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to