manuzhang commented on issue #12288:
URL: https://github.com/apache/iceberg/issues/12288#issuecomment-2662370812

   Here's a solution (using Spark SQL as an example).
   
   1. Find added position delete files from the `entries` metadata table of a 
`snapshot_id`.
   
   ```sql
   select data_file.file_path from csdw_t.F_CS_CNVRSTN_SEGMENT.entries where 
snapshot_id='$snapshot_id' and data_file.content=1;
   ```
   
   2. Collect affected data files from content position delete files, whose 
columns are data_file_path and position.
   
   ```sql
   select * from parquet.`$position_delete_file.parquet`
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to