liurenjie1024 commented on issue #1604: URL: https://github.com/apache/iceberg-rust/issues/1604#issuecomment-3200072313
Thanks @ZENOTME for raising this, but I have concern to do the filter in task planning phase, as opening data files is typically slow. There are two places to improve: 1. Size base planning, see https://github.com/apache/iceberg-rust/issues/128 2. Current data fusion table scan has only one partition: https://github.com/apache/iceberg-rust/blob/6e21b7fcfff2819e7d764b17eff066ed3424ccd5/crates/integrations/datafusion/src/physical_plan/scan.rs#L105 3. Filter push down not working: https://github.com/apache/iceberg-rust/blob/36cc12087bc118f1fe10efa6b30db98bd9655ba7/crates/integrations/datafusion/src/table/mod.rs#L152 As with row group pruning, it should be pushed to parquet reader. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
