pvary commented on code in PR #12230: URL: https://github.com/apache/iceberg/pull/12230#discussion_r1952054468
########## format/spec.md: ########## @@ -392,7 +392,7 @@ In v3 and later, an Iceberg table can track row lineage fields for all newly cre These fields are assigned and updated by inheritance because the commit sequence number and starting row ID are not assigned until the snapshot is successfully committed. Inheritance is used to allow writing data and manifest files before values are known so that it is not necessary to rewrite data and manifest files when an optimistic commit is retried. -When row lineage is enabled, new snapshots cannot include [Equality Deletes](#equality-delete-files). Row lineage is incompatible with equality deletes because lineage values must be maintained, but equality deletes are used to avoid reading existing data before writing changes. +Row lineage does not track updates for rows updated because of [Equality Deletes](#equality-delete-files). Rows updated via an equality delete are always treated as if the row was completely removed and a unique new row was created. Review Comment: Maybe it is too technical, but equality delete doesn't have to do anything with row lineage. The engines are to blame which doesn't populate the old rowId in the new version of the row. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org