ebyhr opened a new issue, #14284: URL: https://github.com/apache/iceberg/issues/14284
### Feature Request / Improvement Hi, this is a proposal from the Trino community to decouple Iceberg from the Hadoop codebase. As some of you may already know, Trino decoupled its Hadoop codebase last year (see trinodb/trino#15921). The motivation and background for that change are explained in detail in the link above. We’ve found that Iceberg’s current dependency on Hadoop makes it harder to adopt new Iceberg features. In many cases, Trino Iceberg connector developers need to reinvent the wheel when a feature depends on the Parquet or ORC reader/writer. While this is technically possible, it often requires non-trivial effort and increases the risk of bugs or inconsistencies, which isn’t ideal for users. Since Parquet and ORC are just file formats, relying on Hadoop shouldn’t be a strict requirement. We’d like to propose supporting native readers and writers for Parquet and ORC files - similar to Trino’s approach which could simplify maintenance and make it easier for downstream projects to stay up to date with Iceberg’s latest features. Many Trino users expect new Iceberg features to become available shortly after we upgrade the library, and reducing this dependency would help make that possible. We’d love to hear your thoughts and feedback on this idea. ### Query engine Trino ### Willingness to contribute - [ ] I can contribute this improvement/feature independently - [ ] I would be willing to contribute this improvement/feature with guidance from the Iceberg community - [ ] I cannot contribute this improvement/feature at this time -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
