slfan1989 opened a new pull request, #669:
URL: https://github.com/apache/iceberg-cpp/pull/669

   ## Summary
   
   This PR adds parallel file deletion support to `ExpireSnapshots` cleanup.
   
   The change is scoped to the internal cleanup path only:
   - Keeps the public `ExpireSnapshots` API unchanged.
   - Uses an internal default delete parallelism.
   - Splits cleanup files into chunks and deletes them concurrently.
   - Preserves the existing best-effort cleanup behavior.
   - Keeps `DeleteWith(...)` supported for custom deletion logic.
   
   ## Motivation
   
   `ExpireSnapshots` may need to delete many files during cleanup, including 
data files, manifest files, manifest list files, and statistics files. Deleting 
these files sequentially can slow down snapshot expiration for large tables.
   
   Parallelizing the cleanup deletion path improves throughput without changing 
the default behavior of `FileIO::DeleteFiles`.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to