RussellSpitzer commented on code in PR #16025:
URL: https://github.com/apache/iceberg/pull/16025#discussion_r3227383216


##########
format/spec.md:
##########
@@ -75,9 +75,9 @@ This table format tracks individual data files in a table 
instead of directories
 
 Table state is maintained in metadata files. All changes to table state create 
a new metadata file and replace the old metadata with an atomic swap. The table 
metadata file tracks the table schema, partitioning config, custom properties, 
and snapshots of the table contents. A snapshot represents the state of a table 
at some time and is used to access the complete set of data files in the table.
 
-Data files in snapshots are tracked by one or more manifest files that contain 
a row for each data file in the table, the file's partition data, and its 
metrics. The data in a snapshot is the union of all files in its manifests. 
Manifest files are reused across snapshots to avoid rewriting metadata that is 
slow-changing. Manifests can track data files with any subset of a table and 
are not associated with partitions.

Review Comment:
   Do we want to add anything up front about V4 unifying this structure? I just 
note that you extended this paragraph to say Data Manifests and Delete 
Manifests, but now we will only have one type of manifest



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to