liurenjie1024 commented on PR #960: URL: https://github.com/apache/iceberg-rust/pull/960#issuecomment-2655880166
> So in summary, for this feature, what we would like to provide is a parquet_files_to_data_files in the arrow module(or create a parquet module). The parquet_files_to_data_files actually does two things: > > schema compatibility check > metrics collection (we can derive a function data_file_statistics_from_parquet_metadata for this which can be reused in parquet file writer) This sounds reasonable to me. After we have this function, could extend `FastAppendAction` to add existing parquet file. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org