rdblue commented on code in PR #7105:
URL: https://github.com/apache/iceberg/pull/7105#discussion_r1209509285


##########
format/spec.md:
##########
@@ -702,6 +703,21 @@ Blob metadata is a struct with the following fields:
 | _optional_ | _optional_ | **`properties`** | `map<string, string>` | 
Additional properties associated with the statistic. Subset of Blob properties 
in the Puffin file. |
 
 
+#### Partition statistics
+
+Partition statistics files are the valid files based on [Partition statistics 
spec](../partition-statistics-spec). Partition statistics are informational. A 
reader can choose to
+ignore partition statistics information. Partition statistics support is not 
required to read the table correctly. A table can contain
+many partition statistics files associated with different table snapshots.

Review Comment:
   It's fine to say that partition stats files are informational and it's okay 
not to read them. But what we really need is a statement of the specific 
_requirements_ for partition stats files. When producing them, what must be 
true for them to be reliable information for engines?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to