[ 
https://issues.apache.org/jira/browse/LUCENE-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17539976#comment-17539976
 ] 

Adrien Grand commented on LUCENE-10581:
---------------------------------------

StoredFieldsBenchmark on the entire geonames dataset gives me

|| Msec to index || BEST_SPEED || BEST_COMPRESSION ||
| main | 105726 | 248489 |
| patch | 91000 | 200811 |

> Optimize stored fields merges on the first segment
> --------------------------------------------------
>
>                 Key: LUCENE-10581
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10581
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>
> This is mostly repurposing LUCENE-10573. Even though our merge policies no 
> longer perform quadratic merging, it's still possible to configure them with 
> low merge factors (e.g. 2) or they might decide to create unbalanced merges 
> where the biggest segment of the merge accounts for a large part of the 
> merge. In such cases, copying compressed data directly still yields 
> significant benefits.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to