Gerrrr commented on issue #8719: URL: https://github.com/apache/iceberg/issues/8719#issuecomment-1751842819
Turning off the upsert mode also leads to incorrect results: ``` Flink SQL> SELECT * FROM word_count LIMIT 10; +----+--------------------------------+----------------------+ | op | word | cnt | +----+--------------------------------+----------------------+ | +I | d | 36782 | | +I | d | 36783 | | +I | 8 | 36986 | | +I | 8 | 36987 | | +I | 7 | 36870 | | +I | 7 | 36871 | | +I | 5 | 37149 | | +I | 5 | 37150 | | +I | 0 | 37029 | | +I | 0 | 37030 | +----+--------------------------------+----------------------+ ``` For completeness, this result is incorrect because `INSERT INTO word_count SELECT word, COUNT(*) FROM word_table GROUP BY word;` contains deletions for previous values. For example: ``` +----+--------------------------------+----------------------+ | op | word | cnt | +----+--------------------------------+----------------------+ | +I | d | 36782 | | -D | d | 36782 | | +I | d | 36783 | ``` should be treated as: 1. add `d=36782` 2. delete `d=36782` 3. add `d=36783` So the query result should contain only `d=36783`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org