singhpk234 commented on issue #10930: URL: https://github.com/apache/iceberg/issues/10930#issuecomment-2295114217
> which would specify if bounds are exact for a particular data file. This would allow to reliably perform aggregation pushdown to Iceberg by engines. By default lower_bounds and upper_bounds should be treated as non-exact by engines. +1, IMHO its not possible to track with what metric configuration a particular data file was created, having a dedicated field would be great though optional drops a ball when different writers are concerned, but I think lower_bound and upper_bound are optional in first place :) so we should be good in that aspect ! > how much real-world improvement it would bring, cause it's not like the current file-level metadata is useless for aggregation queries speedup Looking forward to it in spark atleast we replace the scan to Local Scan so actually don't read any data file, not sure how trino implements it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org