ajantha-bhat commented on code in PR #7105:
URL: https://github.com/apache/iceberg/pull/7105#discussion_r1136560951


##########
format/spec.md:
##########
@@ -702,6 +703,21 @@ Blob metadata is a struct with the following fields:
 | _optional_ | _optional_ | **`properties`** | `map<string, string>` | 
Additional properties associated with the statistic. Subset of Blob properties 
in the Puffin file. |
 
 
+#### Partition statistics
+
+Partition statistics files are the valid files based on [Partition statistics 
spec](../partition-statistics-spec). Partition statistics are informational. A 
reader can choose to
+ignore partition statistics information. Partition statistics support is not 
required to read the table correctly. A table can contain
+many partition statistics files associated with different table snapshots.
+
+Partition statistics files metadata within `partition-statistics` table 
metadata field is a struct with the following fields:
+
+| v1 | v2 | Field name                 | Type     | Description                
                                                                           |
+|----|----|----------------------------|----------|-------------------------------------------------------------------------------------------------------|
+| _required_ | _required_ | **`snapshot-id`**          | `long`   | ID of the 
Iceberg table's snapshot the partition statistics file is associated with.      
            |
+| _required_ | _required_ | **`statistics-file-path`** | `string` | Path of 
the partition statistics file. See [Partition statistics 
spec](../partition-statistics-spec). |
+| _required_ | _required_ | **`sequence-number`**      | `long`   | Sequence 
number of the Iceberg table's snapshot the partition statistics was computed 
from.           |

Review Comment:
   I might still have some gaps in understanding sequence numbers fully.
   
   But I see that Puffin also saves it. So, I followed the same.
   
https://github.com/apache/iceberg/blob/master/format/spec.md#table-statistics 
   
   Do you have any suggestions for this? 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to