kevinjqliu opened a new issue, #14925:
URL: https://github.com/apache/iceberg/issues/14925

   ### Apache Iceberg version
   
   None
   
   ### Query engine
   
   None
   
   ### Please describe the bug 🐞
   
   Relates to #14914
   
   In the ["Writing data files" 
section](https://iceberg.apache.org/spec/#writing-data-files) of the spec, it 
mentions that 
   """
   All columns **must be written to data files** even if they introduce 
redundancy with metadata stored in manifest files (e.g. columns with identity 
partition transforms).
   """
   
   However, in the ["Column Projection" 
section](https://iceberg.apache.org/spec/#column-projection) of the spec, the 
partition value **can be** missing from the data files for identity transforms. 
The identity partition values can be projected by the reader to support the 
hive migration use case. 
   
   We should clarify the "Writing data files" and mention this particular edge 
case 
   
   ### Willingness to contribute
   
   - [ ] I can contribute a fix for this bug independently
   - [ ] I would be willing to contribute a fix for this bug with guidance from 
the Iceberg community
   - [ ] I cannot contribute a fix for this bug at this time


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to