slfan1989 commented on PR #12779:
URL: https://github.com/apache/iceberg/pull/12779#issuecomment-2814624456

   @sririshindra Thank you very much for your message! The information you 
provided is very detailed and insightful. From my personal understanding, I 
believe the original purpose of the `Snapshot` is to ensure that the data in 
the source table remains unchanged, meaning that no data or information in the 
source table is modified.
   
   Although writing metadata into the source table directory may not 
immediately cause noticeable issues, from a design perspective, this still 
constitutes a modification. As a user, I'd like to share some of my thoughts: I 
am currently working on converting Hive tables to Iceberg tables.
   
   During this process, I do want the source table to remain unchanged in order 
to ensure data consistency and facilitate subsequent rollback operations. After 
all, the source table usually holds critical business data, and we rely on it 
as a stable foundation. Therefore, I chose to use the Snapshot process for the 
conversion to ensure that the data in the source table is preserved to the 
greatest extent during the migration to the Iceberg table.
   
   Regarding the other details you raised, I will carefully review the code and 
provide further analysis and feedback.
   
   Your detailed analysis has left a strong impression on me. Thank you again 
for your valuable insights!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to