a-agmon commented on issue #265: URL: https://github.com/apache/iceberg-rust/issues/265#issuecomment-2012751222
Hi @viirya Perhaps a bit off-topic but wondering what you think. I have been testing this a bit, and while I have always seen performance improvements in using `ParquetRecordBatchStream` over `ParquetRecordBatchReader`, the benefit of using `RowFilter` was really dependent on the predicate and data. Sometimes it even had a negative impact on performance (even comparing to non async reader). I think that was the case when filtering for very "common" values. Is there some conventional wisdom regarding when it shouldn't be used? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org