slfan1989 commented on PR #12779: URL: https://github.com/apache/iceberg/pull/12779#issuecomment-2814624456
@sririshindra Thank you very much for your message! The information you provided is very detailed and insightful. From my personal understanding, I believe the original purpose of the `Snapshot` is to ensure that the data in the source table remains unchanged, meaning that no data or information in the source table is modified. Although writing metadata into the source table directory may not immediately cause noticeable issues, from a design perspective, this still constitutes a modification. As a user, I'd like to share some of my thoughts: I am currently working on converting Hive tables to Iceberg tables. During this process, I do want the source table to remain unchanged in order to ensure data consistency and facilitate subsequent rollback operations. After all, the source table usually holds critical business data, and we rely on it as a stable foundation. Therefore, I chose to use the Snapshot process for the conversion to ensure that the data in the source table is preserved to the greatest extent during the migration to the Iceberg table. Regarding the other details you raised, I will carefully review the code and provide further analysis and feedback. Your detailed analysis has left a strong impression on me. Thank you again for your valuable insights! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org