paul-bormans-pcgw commented on issue #11695:
URL: https://github.com/apache/iceberg/issues/11695#issuecomment-2548089198

   I did another test run with more debug info but it doesn't make sense to me; 
it seems when a snapshot expires which has operation="delete", the 
findFilesToDelete doesn't find any entries inside the manifest with status 
DELETED?
   ```
   SplitRunner-20241216_162336_00244_5v5h5.0.0.0-1-2254 
org.apache.iceberg.IncrementalFileCleanup       Expired snapshot: 
BaseSnapshot{id=3231194833683786375, timestamp_ms=1734355116274, 
operation=delete, summary={trino_query_id=20241216_131810_00097_5v5h5, 
added-position-delete-files=94, added-delete-files=94, 
added-files-size=14250935, added-position-deletes=10695387, 
changed-partition-count=1, total-records=66602999, 
total-files-size=10931827904, total-data-files=573, total-delete-files=94, 
total-position-deletes=10695387, total-equality-deletes=0, 
iceberg-version=Apache Iceberg 1.6.1 (commit 
8e9d59d299be42b0bca9461457cd1e95dbaad086)}, 
manifest-list=s3://demobucket/ts.db/pack/metadata/snap-3231194833683786375-2-07d2a425-cbbd-45f2-b63b-7411726a1e75.avro,
 schema-id=1}
   <...>
   org.apache.iceberg.IncrementalFileCleanup    Evaluate manifests to find 
files to delete; manifestsToScan: [
   GenericManifestFile{content=DATA, 
path=s3://demobucket/ts.db/pack/metadata/07d2a425-cbbd-45f2-b63b-7411726a1e75-m2.avro,
 length=492480, partition_spec_id=1, added_snapshot_id=3231194833683786375, 
added_data_files_count=null, added_rows_count=null, 
existing_data_files_count=null, existing_rows_count=null, 
deleted_data_files_count=null, deleted_rows_count=null, partitions=null, 
key_metadata=null, sequence_number=0, min_sequence_number=0}, 
   
   SplitRunner-20241216_162336_00244_5v5h5.0.0.0-1-2254 
org.apache.iceberg.IncrementalFileCleanup       Files to delete: []
   ```
   
   What am i missing here? Why doesn't a DELETE query on Trino result in 
Manifest entries with status DELETED?
   
   Paul
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to