vburenin opened a new issue, #12477:
URL: https://github.com/apache/iceberg/issues/12477

   ### Feature Request / Improvement
   
   I recently run into an issue where Iceberg tables exploded with the large 
number of delete files. While getting rid of them is not a problem, the problem 
is how long it takes, probably due to the sequential way of compaction.
   
   
   This literally takes 10-20 minutes per partition and should be optimized. 
Tested with Spark 3.4.2 and Iceberg 1.8.1.
   
   <img width="535" alt="Image" 
src="https://github.com/user-attachments/assets/99722d2e-767d-4fac-8094-a728e50fb24f";
 />
   
   ### Query engine
   
   Spark
   
   ### Willingness to contribute
   
   - [ ] I can contribute this improvement/feature independently
   - [ ] I would be willing to contribute this improvement/feature with 
guidance from the Iceberg community
   - [x] I cannot contribute this improvement/feature at this time


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to