fengjiajie commented on PR #8808: URL: https://github.com/apache/iceberg/pull/8808#issuecomment-1765499071
> > Some systems like older versions of Impala do not annotate String type as UTF-8 columns in Parquet files. When importing these Parquet files into Iceberg, reading these Binary columns will encounter type errors. > > The code changes LGTM, but I wonder whether this issue isn't something that should be fixed in Impala itself? Also curious what other think about fixing this in Iceberg @pvary @stevenzwu @Fokko Hi @nastra, thank you for the feedback. Impala is just an example. There are other Parquet data generated by various systems over the years that we want to add to the Iceberg table and read using Flink. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org