pvary commented on issue #9299:
URL: https://github.com/apache/iceberg/issues/9299#issuecomment-1857355625

   Do I understand correctly, that you have 2 different jobs writing to the 
same table in update mode?
   
   I think this situation should be avoided.
   
   Flink update mode deletes the rows like this:
   - If the row is added in a previous checkpoint, it adds an equality delete 
row - every row with a given primary key before this checkpoint should be 
deleted
   - If the row is added in this checkpoint, it adds a position delete row for 
the last inserted row.
   
   This is designed for a single writer case, but could work for multiple 
writers too, with one serious caveat. If you have multiple writers, then you 
never know which one will checkpoint first. The second checkpoint will 
overwrite the results of the first one.
   
   So in the end the **table contents are defined by the Iceberg commit time**, 
and not the time when the actual update happens.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to