gf2121 commented on issue #14803: URL: https://github.com/apache/lucene/issues/14803#issuecomment-2987035325
OLAP engines splits format to `codec` and `compression`, both configurable. For example, you can: * Use `ForUtil` codec and `LZ4` compression in normal filesystem, cache managed by engine. * Use `ForUtil` codec and `None` compression in compression filesystem, cache managed by operating system. I guess the problem here is Lucene is wrapping every thing in its `Codec`, so some structure is compressed twice in a compression filesystem while others expand much in a normal filesystem. And it seems difficult for users to realize how much a compressed filesystem affects Lucene numeric docvalue. This is, perhaps, leaving too much responsibility on the user side. My feeling is still we have something to do here, but i don't really know how to do it correctly. Let's keep it :) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org