Gerrrr commented on issue #8719:
URL: https://github.com/apache/iceberg/issues/8719#issuecomment-1751842819

   Turning off the upsert mode also leads to incorrect results:
   
   ```
   Flink SQL> SELECT * FROM word_count LIMIT 10;
   +----+--------------------------------+----------------------+
   | op |                           word |                  cnt |
   +----+--------------------------------+----------------------+
   | +I |                              d |                36782 |
   | +I |                              d |                36783 |
   | +I |                              8 |                36986 |
   | +I |                              8 |                36987 |
   | +I |                              7 |                36870 |
   | +I |                              7 |                36871 |
   | +I |                              5 |                37149 |
   | +I |                              5 |                37150 |
   | +I |                              0 |                37029 |
   | +I |                              0 |                37030 |
   +----+--------------------------------+----------------------+
   ```
   
   For completeness, this result is incorrect because `INSERT INTO word_count 
SELECT word, COUNT(*) FROM word_table GROUP BY word;` contains deletions for 
previous values. For example:
   
   ```
   +----+--------------------------------+----------------------+
   | op |                           word |                  cnt |
   +----+--------------------------------+----------------------+
   | +I |                              d |                36782 |
   | -D |                              d |                36782 |
   | +I |                              d |                36783 |
   ```
   
   should be treated as:
   1. add `d=36782`
   2. delete `d=36782`
   3. add `d=36783`
   
   So the query result should contain only `d=36783`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to