dramaticlly commented on PR #11045: URL: https://github.com/apache/iceberg/pull/11045#issuecomment-2329762019
Had some offline discussion with @flyrain and decided to update documentation instead of the original code change, given the [row lineage proposal in v3](https://docs.google.com/document/d/146YuAnU17prnIhyuvbCtCtVSavyd5N7hKryyVRaFDTE/edit#heading=h.8vlydu9k3h0e) looks more promising as it will add a marker to every row in an Iceberg table to indicate its origin and it's better suited for CDC usage here. The current approach rely on partitioning and sorting to identify the carry over rows cannot afford to skip unorderable columns and keep them in memory to identify the carry over rows in iterators. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org