hantangwangd commented on issue #10982: URL: https://github.com/apache/iceberg/issues/10982#issuecomment-2301489759
As I understand, we should not directly clean a file in a DELETE entry of the expired snapshot, but can only clean it safely when one of the following conditions are met: - There are no Snapshots older than this expired one - There are no Snapshots exist between the one that added it and this one We can use the DELETE entry's file sequence number to figure out the snapshot which added this file to the table. If unable to obtain this message (as in format v1), we should leave the file as it is rather than clean it eagerly. The fix PR is here: #10983 . If there are any misundertandings, please let me know. Any informations would be greatly appreciated! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org