sdd commented on issue #630:
URL: https://github.com/apache/iceberg-rust/issues/630#issuecomment-2357605851

   I'm happy to add the partitioning result to the task. This is useful to the 
executor node when deciding how to distribute tasks, as it enables the use of a 
few different strategies, the choice of which can be left to the implementer.
   
   It is not necessarily the case that the delete file is read repeatedly if 
the delete file list is added to the file scan task, since we can store the 
parsed delete files inside the object cache, preventing them from being read 
repeatedly on the same node as they'd already be in memory. If the executor 
ensures that all tasks with the same partition get sent to the same executor, 
then the files would only be read once.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to