rshkv opened a new pull request, #863:
URL: https://github.com/apache/iceberg-rust/pull/863

   Re #823. This adds support for the the [Manifest Entries 
(docs)](https://iceberg.apache.org/docs/latest/spark-queries/#entries) which 
lists entries in the current snapshot's manifest files.
   
   I'm putting this up as draft because I want to first rebase on #841 which 
will reduce the diff (it looks like I'm adding all of `metadata_table.rs` but 
it's just a rename from `metadata_scan.rs` per [this 
comment](https://github.com/apache/iceberg-rust/pull/822#discussion_r1899617626)).
   
   This PR ended up being quite verbose because `arrow-rs` is strict about 
declaring types of built arrays at compile time which is `pyarrow` is more 
lenient about. That's hard to do when we have struct with partition values and 
upper/lower bounds where we only learn the type at runtime. Let me know if you 
think of a better way to handle those columns.
   
   Reference implementations:
   * [Java]
   * [PyIceberg]
   
   [Java]: 
https://github.com/apache/iceberg/blob/apache-iceberg-1.7.1/core/src/main/java/org/apache/iceberg/ManifestEntriesTable.java
   [PyIceberg]: 
https://github.com/apache/iceberg-python/blob/0e5086ceb77351bc0b6ec3a592f5eda70a0afe46/pyiceberg/table/inspect.py#L92-L213


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to