ebyhr opened a new pull request, #13161:
URL: https://github.com/apache/iceberg/pull/13161

   I've been investigating the OOM issue in the Trino coordinator.
   
   It turns out that a significant portion of memory usage comes from the 
statistics stored in `DeleteFileIndex`.
   To address this, I propose we avoid copying the stats - just like we already 
do for DataFile in the `ManifestGroup#createFileScanTasks` method.
   
   
https://github.com/apache/iceberg/blob/5a096b764fa88e4a71d0d53e90aaa58ecb6d6db8/core/src/main/java/org/apache/iceberg/ManifestGroup.java#L364-L365
   
   Related Slack thread: 
https://apache-iceberg.slack.com/archives/C03LG1D563F/p1747988880908899


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to