andormarkus commented on PR #1742: URL: https://github.com/apache/iceberg-python/pull/1742#issuecomment-2695267357
We want to avoid `streaming` and `Spark` on all costs. We have burned ourself with both. I have no problem with Avro (Manifestfile) combined with `append_data_file`. Yes from current PR is kind of obsolete because I think the `append_data_file` can be better approach. I like your #1678 however this would create to much commit. We have implemented this [suggestion](https://github.com/apache/iceberg-python/issues/1084#issuecomment-2423927862) however this created to much commit was was really performance killer. I will close this PR and related issue and reopen everything as distributed write -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org