Fokko opened a new pull request, #5338:
URL: https://github.com/apache/iceberg/pull/5338

   When building the Manifest mappers for Python, @rdblue  noticed that the 
`added_data_files_count` should be `added_files_count` according to the spec. 
   
   However, this field is written in Java as `added_data_files_count` to Avro 
by the Java implementation:  
   
https://github.com/apache/iceberg/blob/8104769f81ba79338fd3c94d5bd9267f22d31ed7/api/src/main/java/org/apache/iceberg/ManifestFile.java#L44-L49
   
   Luckily this doesn't affect the reading/writing because it is position based.
   
   However, it is confusing. I think we should resolve this. We could either do 
this by changing the Java impl, which probably works, but we could also change 
the spec. I know that this isn't something lightweight, but we could take it 
into consideration.
   
   I think we should also update the references in the code to 
`{added,existing,deleted}_data_files_count` to make everything consistent and 
avoid confusion in the future.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to