huaxingao opened a new pull request, #11551: URL: https://github.com/apache/iceberg/pull/11551
In Equality Delete, we build `ColumnarBatchReader` for the equality delete filter columns to read their values and determine which rows are deleted. If these filter columns are not among the requested columns, they are considered extra and should be removed before returning the `ColumnBatch` to Spark. Suppose the table schema includes C1, C2, C3, C4, C5. If the query is: `SELECT C5 FROM table`, and the equality delete filter is on C3 and C4, We read the values of C3 and C4 to identify which rows are deleted. However, we do not want to include these values in the `ColumnBatch` that we return to Spark. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org