rdblue commented on code in PR #15049:
URL: https://github.com/apache/iceberg/pull/15049#discussion_r2898323993


##########
api/src/main/java/org/apache/iceberg/FileContent.java:
##########
@@ -18,11 +18,18 @@
  */
 package org.apache.iceberg;
 
-/** Content type stored in a file, one of DATA, POSITION_DELETES, or 
EQUALITY_DELETES. */
+/** Content type stored in a file. */
 public enum FileContent {
+  /** Data file content. Stored in data manifests since v1. */

Review Comment:
   I think that the docs here are too specific and aren't adding much value.
   
   Here, for example, "data file content" duplicates "file content". A more 
useful description is "A data file", but I don't think that is really 
necessary. The content of the file this is attached to is "data" -- that's 
already clear.
   
   The additional symbols are also good: file content is a "data manifest", or 
file content is a "delete manifest".
   
   Also, this was not stored in data manifests since v1. All files in v1 were 
data files, but this was not stored anywhere. This is one reason why 
over-specifying in docs is a problem. The other problem is that you have to 
remember to come back here and update it if you include requirements or 
assumptions. The best practice is it be very simple and not add anything you 
don't need to.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to