pvary commented on issue #9299: URL: https://github.com/apache/iceberg/issues/9299#issuecomment-1857355625
Do I understand correctly, that you have 2 different jobs writing to the same table in update mode? I think this situation should be avoided. Flink update mode deletes the rows like this: - If the row is added in a previous checkpoint, it adds an equality delete row - every row with a given primary key before this checkpoint should be deleted - If the row is added in this checkpoint, it adds a position delete row for the last inserted row. This is designed for a single writer case, but could work for multiple writers too, with one serious caveat. If you have multiple writers, then you never know which one will checkpoint first. The second checkpoint will overwrite the results of the first one. So in the end the **table contents are defined by the Iceberg commit time**, and not the time when the actual update happens. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org