greenlaw commented on issue #13218: URL: https://github.com/apache/iceberg/issues/13218#issuecomment-3212150378
I am hitting this same issue, and it has broken my workflow after upgrading to Iceberg Java v1.9.x. Prior to the visitor-pattern metrics-collection changes in #12496 (cc @rdblue), I was able to call `ParquetUtil.fileMetrics(inputFile, metricsConfig);` to gather field statistics on any parquet file, but now it seems that it relies on the presence of the iceberg schema / field id metadata in the parquet file itself. Seems like the easiest workaround for my use case is to bypass the `ParquetUtil` class entirely construct the `MessageType` and `Schema` instances manually, and call `ParquetMetrics.metrics()` directly. But it would be great to know if a fix for restoring the ability to register externally-created Parquet files with Iceberg is on the roadmap. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
