findepi opened a new issue, #6443: URL: https://github.com/apache/iceberg/issues/6443
### Feature Request / Improvement When a query engine wants to add new stats to a snapshot that already has some stats, it currently needs to merge existing stats file' blobs with new ones. Currently, the only Puffin reader API for reading blobs will decompress them implicitly. The application merging stats probably doesn't know much about these old stats, so also doesn't know whether they should be compressed, so it should preserve the compression. Thus it will want to re-compress them again. - This process is wasteful: redundant decompression and compression - Also, it is not possible to implement it in a future-proof manner: application can preserve compression only for the puffin codecs it was built with ### Query engine None -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org