huaxiangsun opened a new issue, #12770:
URL: https://github.com/apache/iceberg/issues/12770

   ### Feature Request / Improvement
   
   We are creating tables with large size columns (up to hundreds Mbs),  these 
large size columns normally are some bytes where stats (min/max) does not make 
sense. Today, for Iceberg tables' data files in parquet, columns stats is 
enabled by default. The stats for these large size columns consumes lots of 
memory. More background information is at parquet PR.
   
   https://github.com/apache/parquet-java/issues/2521
   
   Similar to metrics, we need table properties to enable/disable parquet 
column statistics. 
   
   ### Query engine
   
   None
   
   ### Willingness to contribute
   
   - [x] I can contribute this improvement/feature independently
   - [ ] I would be willing to contribute this improvement/feature with 
guidance from the Iceberg community
   - [ ] I cannot contribute this improvement/feature at this time


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to