tomtongue opened a new pull request, #9096: URL: https://github.com/apache/iceberg/pull/9096
The parquet default compression codec is still `gzip` in the doc. I tested the Iceberg 1.4.1 with Spark, and it's `zstd` from my check: ``` # DESCRIBE EXTENDED TABLE ... |Provider |iceberg | | |Owner |spark | | |Table Properties |[current-snapshot-id=5229490619909685802,format=iceberg/parquet,format-version=2,write.metadata.compression-codec=gzip,write.parquet.compression-codec=zstd]| | +----------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------+-------+ ``` A Parquet file that was written by the Spark app: ``` $ parquet footer 00000-0-81cf5185-9ecd-424f-9f65-f93a84f3e390-00001.parquet | grep codec | head -1 "codec" : "ZSTD", ``` In addition to my check, the following release notes say: https://iceberg.apache.org/releases/ > Use zstd compression for Parquet by default in new tables ([#8593](https://github.com/apache/iceberg/pull/8593)) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org