Fokko opened a new issue, #23:
URL: https://github.com/apache/iceberg-python/issues/23

   ### Feature Request / Improvement
   
   ### Feature Request / Improvement
   
   This is a placeholder ticket for implementing write support for PyIceberg.
   
   Since we don't want PyIceberg to write the actual data, and only handle the 
metadata part of the Iceberg table format, we need to get an overview of the 
frameworks we most likely want to integrate with (PyArrow, Dask (fastparquet?), 
etc).
   
   I would suggest the following first steps to keep it simple: Write using 
PyArrow (since that's the most commonly used FileIO) and start with 
unpartitioned tables.
   
   What we need:
   
   - [x] Avro write support: https://github.com/apache/iceberg/issues/7255
   - [x] Write files and extract statistics: 
https://github.com/apache/iceberg/issues/7256
   - [ ] Ability to alter the Manifest JSON: 
https://github.com/apache/iceberg-python/issues/22
   - [x] Proper integration tests between Java and Python: 
https://github.com/apache/iceberg/pull/6398
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to