Fokko opened a new issue, #23: URL: https://github.com/apache/iceberg-python/issues/23
### Feature Request / Improvement ### Feature Request / Improvement This is a placeholder ticket for implementing write support for PyIceberg. Since we don't want PyIceberg to write the actual data, and only handle the metadata part of the Iceberg table format, we need to get an overview of the frameworks we most likely want to integrate with (PyArrow, Dask (fastparquet?), etc). I would suggest the following first steps to keep it simple: Write using PyArrow (since that's the most commonly used FileIO) and start with unpartitioned tables. What we need: - [x] Avro write support: https://github.com/apache/iceberg/issues/7255 - [x] Write files and extract statistics: https://github.com/apache/iceberg/issues/7256 - [ ] Ability to alter the Manifest JSON: https://github.com/apache/iceberg-python/issues/22 - [x] Proper integration tests between Java and Python: https://github.com/apache/iceberg/pull/6398 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org