manuzhang commented on issue #12288: URL: https://github.com/apache/iceberg/issues/12288#issuecomment-2662370812
Here's a solution (using Spark SQL as an example). 1. Find added position delete files from the `entries` metadata table of a `snapshot_id`. ```sql select data_file.file_path from csdw_t.F_CS_CNVRSTN_SEGMENT.entries where snapshot_id='$snapshot_id' and data_file.content=1; ``` 2. Collect affected data files from content position delete files, whose columns are data_file_path and position. ```sql select * from parquet.`$position_delete_file.parquet` ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org