gaborkaszab commented on issue #6257:
URL: https://github.com/apache/iceberg/issues/6257#issuecomment-1398396353

   > What would the algorithm be? If the partition has delete files, try to do 
a full MOR, and check if records are null? Personally, sounds a bit extreme, I 
would think a good first step is just add a column for delete_files (It may be 
easier after my new change in #6365). After all, we do have a partition 
existing, just its of invalid delete files. Interested to hear others thoughts 
as well.
   
   Well, giving this a second (and a third) thought I have to admit that 
applying delete file on the data files to get the partitions is too 
heavyweight. I'm wondering if we should document this behaviour somewhere as I 
remember on Slack there was someone confused about the 'record_count' column of 
the metadata table not adding up to the same value what count(*) gives.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to