pvary commented on PR #8803:
URL: https://github.com/apache/iceberg/pull/8803#issuecomment-1768023880

   > @pvary I think we probably want to push the `copyStatsForColumns` down to 
ManifestReader. 
https://github.com/apache/iceberg/blob/main/core/src/main/java/org/apache/iceberg/ManifestReader.java#L299
   
   That is for reading the data from the manifest file.
   If we want at statistics for at least one column, then the manifest file 
reading schema should contain the stat fields, like:
   ```
     private static final Set<String> STATS_COLUMNS =
         ImmutableSet.of(
             "value_counts",
             "null_value_counts",
             "nan_value_counts",
             "lower_bounds",
             "upper_bounds",
             "record_count");
   ```
   
   So we can not do filtering here. We need to read the stat fields from the 
manifest file, and then filter later for columns where we do not need it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to