singhpk234 opened a new pull request, #7531:
URL: https://github.com/apache/iceberg/pull/7531

   ### About the change
   
   presently when using brotli as compression codec for parquet it fails with 
   
   ```
   Caused by: org.apache.parquet.hadoop.BadConfigurationException: Class 
org.apache.hadoop.io.compress.BrotliCodec was not found
        at 
org.apache.parquet.hadoop.CodecFactory.getCodec(CodecFactory.java:243)
        at 
org.apache.parquet.hadoop.CodecFactory$HeapBytesCompressor.<init>(CodecFactory.java:144)
        at 
org.apache.parquet.hadoop.CodecFactory.createCompressor(CodecFactory.java:208)
        at 
org.apache.parquet.hadoop.CodecFactory.getCompressor(CodecFactory.java:191)
   ```
   
   this just makes sure BrotliCodec is available in class path so that it can 
be loaded and doesn't fails with the error above. 
   
   Alternatively we could also mention in iceberg to install BrotliCodec when 
selecting brotli as the compression, somewhat spark does 
(https://spark.apache.org/docs/latest/sql-data-sources-parquet.html, ref: 
brotli requires BrotliCodec to be installed) rather than taking a new 
dependency (this repo for BrotliCodec was created by ryan himself :) ) . 
   
   cc @rdblue @jackye1995 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to