zhongyujiang opened a new pull request, #6967: URL: https://github.com/apache/iceberg/pull/6967
This implements a `ParquetColumnIndexFilter` which is equivalent to the Parquet-mr's `ColumnIndexFilter`, but based on Iceberg's expressions. Currently, some of Parquet methods needed in the PR are not accessible, so this uses the reflection method shared in #6935 to get the necessary information. This also applys`ParquetColumnIndexFilter` to row read path and updates the row read path to handle `RowRanges` generated by `ParquetColumnIndexFilter` to support page skipping. The batch read path will be updated in a subsequent PR. Coauthored by @rdblue -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
