[I] Spark's optimizer miss-merges Iceberg's two different refs because their canonicalized forms are same. [iceberg]

via GitHub Tue, 09 Jan 2024 05:14:04 -0800


wooyeong opened a new issue, #9450:
URL: https://github.com/apache/iceberg/issues/9450


   ### Apache Iceberg version
   
   1.4.2 (latest release)
   
   ### Query engine
   
   Spark
   
   ### Please describe the bug 🐞
   
   I was trying to make a history table using the time travel feature.
   For example, I could only insert one row for the date of 2024-01-02 thanks 
to time travel.
   
   user|date
   --|--
   a | 2024-01-01
   b | 2024-01-01
   a | 2024-01-02
   b | 2024-01-02
   c | 2024-01-02
   
   However, I encountered a weird bug that blocked me from making a complete 
set of historical data.
   I'll paste reproducible queries and the possible cause below.
   
   ## Queries
   - Run on Spark 3.5.0 and Iceberg 1.4.2
   
   ```sql
   -- Create table
   CREATE TABLE iceberg_except_test (
       id string,
       a string,
       b timestamp
   )
   USING ICEBERG
   PARTITIONED BY (DATE(b));
   
   -- Insert a row: (1, a, 2023-01-01)
   MERGE INTO iceberg_except_test AS t
   USING
     (SELECT '1' AS id, 'a' AS a, date '2023-01-01' AS b) AS s
   ON t.id = s.id
   WHEN MATCHED THEN UPDATE SET t.a = s.a, t.b = s.b
   WHEN NOT MATCHED THEN INSERT *;
   
   -- Set tag `2023-01-01` -> [(1, a, 2023-01-01)]
   ALTER TABLE iceberg_except_test CREATE OR REPLACE TAG `2023-01-01`;
   
   -- Update the row: (1, b, 2024-01-01)
   MERGE INTO iceberg_except_test AS t
   USING
     (SELECT '1' AS id, 'b' AS a, date '2024-01-01' AS b) AS s
   ON t.id = s.id
   WHEN MATCHED THEN UPDATE set t.a = s.a, t.b = s.b
   WHEN NOT MATCHED THEN INSERT *;
   
   -- Set tag `2024-01-01` -> [(1, b, 2024-01-01)]
   ALTER TABLE iceberg_except_test CREATE OR REPLACE TAG `2024-01-01`;
   
   -- Fetch two tags altogether -> [(1, a, 2023-01-01), (1, b, 2024-01-01)]
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2023-01-01'
   
   UNION ALL 
   
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2024-01-01';
   
   -- Fetch distinct rows from two tags -> [(1, a, 2023-01-01)] * It should 
return two rows.
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2023-01-01'
   
   UNION
   
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2024-01-01';
   ```
   
   ## Query plan
   - In the optimization phase, two sub-queries merged into one.
   
   ```
   > == Parsed Logical Plan ==
   > 'Distinct
   > +- 'Union false, false
   >    :- 'Project [*]
   >    :  +- 'RelationTimeTravel 'UnresolvedRelation [iceberg_except_test], 
[], false, 2023-01-01
   >    +- 'Project [*]
   >       +- 'RelationTimeTravel 'UnresolvedRelation [iceberg_except_test], 
[], false, 2024-01-01
   > 
   > == Analyzed Logical Plan ==
   > id: string, a: string, b: timestamp
   > Distinct
   > +- Union false, false
   >    :- Project [id#189, a#190, b#191]
   >    :  +- SubqueryAlias local.iceberg_except_test
   >    :     +- RelationV2[id#189, a#190, b#191] local.iceberg_except_test 
local.iceberg_except_test
   >    +- Project [id#192, a#193, b#194]
   >       +- SubqueryAlias local.iceberg_except_test
   >          +- RelationV2[id#192, a#193, b#194] local.iceberg_except_test 
local.iceberg_except_test
   > 
   > == Optimized Logical Plan ==
   > Aggregate [id#189, a#190, b#191], [id#189, a#190, b#191]
   > +- RelationV2[id#189, a#190, b#191] local.iceberg_except_test
   > 
   > == Physical Plan ==
   > AdaptiveSparkPlan isFinalPlan=false
   > +- HashAggregate(keys=[id#189, a#190, b#191], functions=[], 
output=[id#189, a#190, b#191])
   >    +- Exchange hashpartitioning(id#189, a#190, b#191, 200), 
ENSURE_REQUIREMENTS, [plan_id=574]
   >       +- HashAggregate(keys=[id#189, a#190, b#191], functions=[], 
output=[id#189, a#190, b#191])
   >          +- BatchScan local.iceberg_except_test[id#189, a#190, b#191] 
local.iceberg_except_test (branch=null) [filters=, groupedBy=] RuntimeFilters: 
[]
   ```
   
   
   ## Cause
   - I found out this false optimization happens in 
[RemoveNoopUnion](https://github.com/apache/spark/blob/b888ea877230976e15b251310e3840e3f73f904f/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala#L637)
     - It seems as two sub-queries have same 
[canonicalized](https://github.com/apache/spark/blob/a2d5c9c55d2adea4a914cc647e1a100b8b8cee5d/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala#L555),
 they are accidentally merged into one.
   - When I disabled the rule, the same query generated both rows.
   
   ```sql
   --conf 
spark.sql.optimizer.excludedRules=org.apache.spark.sql.catalyst.optimizer.RemoveNoopUnion
   -- Fetch distinct rows from two tags -> [(1, a, 2023-01-01), (1, b, 
2024-01-01)]
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2023-01-01'
   
   UNION
   
   SELECT * FROM iceberg_except_test
   VERSION AS OF '2024-01-01';
   ```
   
   ## Possible fix
   - I confirmed that the query works after changing to 
[SparkTable#equals](https://github.com/apache/iceberg/blob/afe4aec4db795f4829757d738a5bbcf1b7db8fd2/spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkTable.java#L401C1-L411)
 to compare 
[branch](https://github.com/apache/iceberg/blob/afe4aec4db795f4829757d738a5bbcf1b7db8fd2/spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkTable.java#L123)
 and 
[snapshotId](https://github.com/apache/iceberg/blob/afe4aec4db795f4829757d738a5bbcf1b7db8fd2/spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkTable.java#L120)
 as well as name, to have unique canonical form for each ref.
     - I built a jar locally and ran a test.
   - I'm not sure if it is the correct way to do so.
   
   
   You could save me huge storage costs if I can utilize this feature. I look 
forward to your advice. Thanks in advance.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

[I] Spark's optimizer miss-merges Iceberg's two different refs because their canonicalized forms are same. [iceberg]

Reply via email to