[GitHub] [iceberg] findepi opened a new issue, #6443: Provide Puffin reader API allowing read without decompression

GitBox Fri, 16 Dec 2022 07:44:21 -0800


findepi opened a new issue, #6443:
URL: https://github.com/apache/iceberg/issues/6443


   ### Feature Request / Improvement
   
   When a query engine wants to add new stats to a snapshot that already has 
some stats, it currently needs to merge existing stats file' blobs with new 
ones.  
   
   Currently, the only Puffin reader API for reading blobs will decompress them 
implicitly.
   The application merging stats probably doesn't know much about these old 
stats, so also doesn't know whether they should be compressed, so it should 
preserve the compression. Thus it will want to re-compress them again.
   
   - This process is wasteful: redundant decompression and compression
   - Also, it is not possible to implement it in a future-proof manner: 
application can preserve compression only for the puffin codecs it was built 
with
   
   ### Query engine
   
   None


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

[GitHub] [iceberg] findepi opened a new issue, #6443: Provide Puffin reader API allowing read without decompression

Reply via email to