rambleraptor opened a new pull request, #3361:
URL: https://github.com/apache/iceberg-python/pull/3361

   <!--
   Thanks for opening a pull request!
   -->
   
   <!-- In the case this PR will resolve an issue, please replace 
${GITHUB_ISSUE_ID} below with the actual Github issue id. -->
   <!-- Closes #${GITHUB_ISSUE_ID} -->
   
   # Rationale for this change
   This adds support for the RemoveOrphanFiles metadata maintenance task. The 
goal is to match the Java implementation.
   
   I had to add a list method to FileIO in order to fully implement this. I can 
separate that work into a separate PR if that's more useful.
   
   A good follow-up would be to wire this into the CLI. Doing these ad-hoc 
actions without having to write a script / spin up a Spark cluster is a huge 
win!
   
   ## Are these changes tested?
   I did some local testing where I took a table with orphaned files and tried 
both the Java/PyIceberg implementations against it. Results were the same.
   
   There's also plenty of tests.
   
   ## Are there any user-facing changes?
   - Adds support for the RemoveOrphanFiles maintenance action
   
   <!-- In the case of user-facing changes, please add the changelog label. -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to