xxhZs commented on issue #630:
URL: https://github.com/apache/iceberg-rust/issues/630#issuecomment-2350911899

   Hi, I've recently implemented merge on read in my library using iceberg rust 
and submitted a working simplified version of the code, which looks somewhat 
similar to the `A naive approach` version you proposed!  (I have to read the 
same delete file on my different nodes)
   This pr is https://github.com/apache/iceberg-rust/pull/625
   About this issue. I have some doubts.
   About FileScanTask {DeleteFileEntry}. as you said, the delete file and data 
file are many-to-many, so even if list delete file is saved in the file task, 
in the optimal case, the call still needs some special operations to make sure 
that all data file and delete file are dispatched to the same node, and that 
the delete file file is not read repeatedly. And most likely, this scheduling 
result is consistent with the partitioning result.
   In this case, I prefer to expose the partitioning result directly in the 
file task. Please correct me if there is any misunderstanding
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to