subthedubdub opened a new issue, #1470:
URL: https://github.com/apache/iceberg-python/issues/1470
   ### Feature Request / Improvement
   
   Support a table transaction where the user can directly supply the file 
metadata, similar to the [Java 
Interface](https://javadoc.io/doc/org.apache.iceberg/iceberg-api/latest/org/apache/iceberg/AppendFiles.html).
   
   This is a lower-level operation than what is covered `add_files` operation 
and it covers several additional use-cases. For example:
   
   1. **Implicit Partitioning**: Iceberg does not require that partition values 
actually be stored in the actual parquet files. And in some cases, it may be 
useful to define a partiion value of a parquet file after it has been written 
(or to intentionally exclude to e.g. minimize storage). However, `add_files` 
does not currently support an option.
   2. **Pre-calculated statistics**: Calculating parquet statistics (e.g. 
column bounds) can be a somewhat resource intensive operations. If this has 
been pre-calculated by the user, it would be faster to update the manifest file 
directly.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to