hguercan commented on issue #13763: URL: https://github.com/apache/iceberg/issues/13763#issuecomment-3210443481
Thanks to @yogevyuval he put our attention to the high number of "added-data-files" where the duplicate file path references are happening. We checked the commits before and the number is insanely high. For the other commits its mostly two to three digits and occasionally some thousands but never that huge as "46015" or "236825". Accordingly it behaves the same with the "added-records" and "added-files-size". We are not sure how this could be explained by having such a big number. The operation-type we are seeing before that failure/duplicate containing commits were maintenance jobs (replace type). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
