Re: [I] De-Duping Rows While Compacting [iceberg]

2025-03-12 Thread via GitHub
haggy commented on issue #8702: URL: https://github.com/apache/iceberg/issues/8702#issuecomment-2719655248 @zenfenan Was there ever any progress made on this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

Re: [I] De-Duping Rows While Compacting [iceberg]

2024-09-21 Thread via GitHub
github-actions[bot] commented on issue #8702: URL: https://github.com/apache/iceberg/issues/8702#issuecomment-2365374193 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] De-Duping Rows While Compacting [iceberg]

2023-10-16 Thread via GitHub
W-I-D-EE commented on issue #8702: URL: https://github.com/apache/iceberg/issues/8702#issuecomment-1765308407 Further to this, i have actually had a lot of trouble getting delete from or merge into working with removing duplicate rows. Today the only way i have been able to remove deuplicat

Re: [I] De-Duping Rows While Compacting [iceberg]

2023-10-12 Thread via GitHub
dramaticlly commented on issue #8702: URL: https://github.com/apache/iceberg/issues/8702#issuecomment-1760455077 data compaction only change physical files layout but not the data visible to users. Consider you originally have 1000 records with 10 duplicates, after deduplication it would be