soumya-ghosh commented on issue #1053: URL: https://github.com/apache/iceberg-python/issues/1053#issuecomment-2646624859
I understand that `files` table by snapshot and `all_files` (and its derivatives) should respect schema evolution. The keys in column of `readable_metrics` is derived from schema, thus the source of inconsistency. I did a test to see the behavior in Spark, observations in [gist](https://gist.github.com/soumya-ghosh/77bccf0fe77926da0b6e96432021879a). It appears that in Spark constructs the `readable_metrics` column by considering the current schema (which maybe a bug). Thoughts @kevinjqliu ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org