kkewwei commented on issue #14347: URL: https://github.com/apache/lucene/issues/14347#issuecomment-2743128817
`I` am wondering if the default for BEST_SPEED should be using preset dict as that compromises speed for compression` It appears that certain scenarios might not be entirely appropriate for LZ4WithPresetDict. In principle, the read I/O may does indeed increase by double, particularly for small documents. For instance, if the size of small document is 1kb, while the dictionary is 4 KB, the read I/O will increase fourfold. I'm attempting to implement dictionary reuse by current buffer. it's lossless. It functions effectively when reading some documents from one chunk(8*10*1024). However, if the read document from different chunk, we have to rebuild the dictionary. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org