Re: [PR] create_table with PyArrow Schema [iceberg-python]

via GitHub Thu, 25 Jan 2024 11:00:32 -0800


syun64 commented on code in PR #305:
URL: https://github.com/apache/iceberg-python/pull/305#discussion_r1466788214



##########
pyiceberg/io/pyarrow.py:
##########
@@ -906,6 +986,76 @@ def after_map_value(self, element: pa.Field) -> None:
         self._field_names.pop()
 
 
+class 
_ConvertToIcebergWithFreshIds(PreOrderPyArrowSchemaVisitor[Union[IcebergType, 
Schema]]):

Review Comment:
   Is there a reason why we decided on this approach to 
assign_fresh_schema_ids, over using a simple pre-order traversal to assign IDs 
as we see the fields? Is this because we want to accommodate for potential 
duplicates?
   
   The usage of **reserved_ids** in _SetFreshIDs is what I'm referring to:
   
   
https://github.com/apache/iceberg-python/blob/0f08806d4431d5d60998dac1bca5780b6d2e2785/pyiceberg/schema.py#L1224-L1256



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Re: [PR] create_table with PyArrow Schema [iceberg-python]

Reply via email to