Re: [I] Specify in lower/upper bounds in data_file struct are exact [iceberg]

via GitHub Sat, 17 Aug 2024 21:57:25 -0700


singhpk234 commented on issue #10930:
URL: https://github.com/apache/iceberg/issues/10930#issuecomment-2295114217


   > which would specify if bounds are exact for a particular data file. This 
would allow to reliably perform aggregation pushdown to Iceberg by engines. By 
default lower_bounds and upper_bounds should be treated as non-exact by engines.
   
   +1, IMHO its not possible to track with what metric configuration a 
particular data file was created, having a dedicated field would be great 
though optional drops a ball when different writers are concerned, but I think 
lower_bound and upper_bound are optional in first place :) so we should be good 
in that aspect !
   
   > how much real-world improvement it would bring, cause it's not like the 
current file-level metadata is useless for aggregation queries speedup
   
   Looking forward to it in spark atleast we replace the scan to Local Scan so 
actually don't read any data file, not sure how trino implements it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [I] Specify in lower/upper bounds in data_file struct are exact [iceberg]

Reply via email to