pvary commented on issue #10892:
URL: https://github.com/apache/iceberg/issues/10892#issuecomment-2276410555

   For the 3rd point:
   - Flink uses two phase commits. In the 1st phase the data is written to a 
temp manifest file, and the file path is stored into the state.
   
   So if there is a failure between the 2 commit phases, it could happen that 
the data is available in the temp manifest file, but not yet committed to the 
Iceberg table. This should be considered before throwing an error.
   
   But if the current snapshot of the Iceberg table is newer than the 
checkpoint we restore from, then it would be fine to revert to the given 
snapshot - we need to throw an exception if there was any concurrent writes to 
the table in the meantime (some other writers might written data independently).
   
   Also we should examine what we can do with the new IcebergSink. @rodmeneses 
could you please chime in?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to