dramaticlly opened a new pull request, #9865: URL: https://github.com/apache/iceberg/pull/9865
Based on comment from https://github.com/apache/iceberg/pull/5215/files#r915475413 and https://github.com/apache/iceberg/pull/5215/files#r915276052 100 as default limit is counted only toward top level columns, which means stats are also collected towards nested children of top level columns, but not counted in 100. Example: ``` root |-- foo1: struct (nullable = true) | |-- bar1: integer (nullable = false) | |-- baz1: string (nullable = true) |-- foo2: struct (nullable = true) | |-- bar2: integer (nullable = false) | |-- baz2: string (nullable = true) ... |-- foo101: struct (nullable = true) | |-- bar101: integer (nullable = true) | |-- baz101: string (nullable = true) ``` - stats for `foo1.bar1`, `foo1.baz1`, `foo2.bar2`, `foo2.baz2` will all be collected use default (truncate(16)) - stats for `foo101.bar101`, `foo101.baz101` will be none CC @stevenzwu -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org