geruh opened a new pull request, #14081:
URL: https://github.com/apache/iceberg/pull/14081

   Fixes #14071
   
   Variant types were not handled correctly in Parquet row group filtering. The 
`ParquetMetricsRowGroupFilter.notNull()` needs to account for variant types 
which require post scan evaluation to access shredded statistics for nested 
field filtering, similar to structs are handled.
   
   This change always returns `ROWS_MIGHT_MATCH` for variant types.
   
   ### Testing 
     - testVariantFilterNotNull(): test with mixed variant/null data
     - testAllNullsVariantNotNull(): Edge case with all-null variant columns
     - TestInclusiveMetricsEvaluatorWithExtract.java already provides extensive 
coverage of variant extract expressions
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to