hantangwangd commented on issue #10982:
URL: https://github.com/apache/iceberg/issues/10982#issuecomment-2301489759

   As I understand, we should not directly clean a file in a DELETE entry of 
the expired snapshot, but can only clean it safely when one of the following 
conditions are met:
   
    - There are no Snapshots older than this expired one
    - There are no Snapshots exist between the one that added it and this one
   
   We can use the DELETE entry's file sequence number to figure out the 
snapshot which added this file to the table. If unable to obtain this message 
(as in format v1), we should leave the file as it is rather than clean it 
eagerly.
   
   The fix PR is here: #10983 . If there are any misundertandings, please let 
me know. Any informations would be greatly appreciated!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to