mdub commented on issue #9898:
URL: https://github.com/apache/iceberg/issues/9898#issuecomment-2436981886

   I've also been experimenting with Bloom filters, and managed to get it 
working fairly easily with a nested field:
   
   ```
   ALTER TABLE glue_catalog.kafka_archive.test_topic
   SET TBLPROPERTIES 
('write.parquet.bloom-filter-enabled.column.kafka_metadata.key'='true')
   ```
   
   Then, after downloading a sample data file:
   
   ```
   $ parquet bloom-filter -c kafka_metadata.key -v foo,4519160c-7d7c-44ae,bar 
/tmp/00001.parquet
   
   Row group 0: 
   
--------------------------------------------------------------------------------
   value foo NOT exists.
   value 4519160c-7d7c-44ae maybe exists.
   value bar NOT exists.
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to