rdblue commented on code in PR #7105: URL: https://github.com/apache/iceberg/pull/7105#discussion_r1209509285
########## format/spec.md: ########## @@ -702,6 +703,21 @@ Blob metadata is a struct with the following fields: | _optional_ | _optional_ | **`properties`** | `map<string, string>` | Additional properties associated with the statistic. Subset of Blob properties in the Puffin file. | +#### Partition statistics + +Partition statistics files are the valid files based on [Partition statistics spec](../partition-statistics-spec). Partition statistics are informational. A reader can choose to +ignore partition statistics information. Partition statistics support is not required to read the table correctly. A table can contain +many partition statistics files associated with different table snapshots. Review Comment: It's fine to say that partition stats files are informational and it's okay not to read them. But what we really need is a statement of the specific _requirements_ for partition stats files. When producing them, what must be true for them to be reliable information for engines? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org