gf2121 commented on issue #12598: URL: https://github.com/apache/lucene/issues/12598#issuecomment-1739489111
I get similar statistics for wikimediumall and here are the results when `BytesStore#finish` called 1,000,000 times. ``` BytesStore#finish called: 1000000 times min: 1 mid: 16 avg: 64.555987 pct75: 28 pct90: 57 pct99: 525 pct999: 4957 pct9999: 29124 max: 631700 ``` It seems 1k bytes per block is enough here. 99% cases can be covered by single block and we at most need 600+ blocks for single FST. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org