tlegrave commented on issue #895: URL: https://github.com/apache/iceberg-python/issues/895#issuecomment-2222175740
Thank you for your reply @syun64. > On a tangent, I see that this property is still [unsupported in PyIceberg for writes](https://github.com/apache/iceberg-python/blob/77a07c90b7ca05c5d915c5c02047807c76b5031e/pyiceberg/io/pyarrow.py#L2054-L2064) - just wanted to make note of that before you dived in deeper into trying to enable this flag for the table Yes, I'm aware of this limitation, and for now we only use pyiceberg to create tables. Write operations are done with spark to handle this bloom filter feature and to bypass some other known limitations of pyiceberg. Regarding the issue, I made some further tests based on what you mentionned. This code also ignores the bloom filter (although passing directly "True" as string). Would it be a case sensitive issue? ```py catalog.create_table( "default.example", schema=Schema( NestedField(1, "created_at", TimestampType(), required=True) ), properties={"write.parquet.bloom-filter-enabled.column.material_id": "True"}, ) ``` For info, I'm extracting the schema after creation of the table doing a `SHOW create table catalog.default.example` on Trino. Thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org