[I] Getting "offset overflow while concatenating arrays" Error when writing to iceberg [iceberg-python]

via GitHub Thu, 27 Feb 2025 01:16:19 -0800


heman026 opened a new issue, #1733:
URL: https://github.com/apache/iceberg-python/issues/1733


   ### Question
   
   I have read a parquet file and loaded into pyarrow table using read_table 
method. I want to write this arrow table to iceberg table. But I am getting the 
following error 
   
   > "pyarrow.lib.ArrowInvalid: offset overflow while concatenating arrays, 
consider casting input from `string` to `large_string` first.". 
   
   Below is the code I am using.    
   
   ```
   arrow_table = pq.read_table(file,coerce_int96_timestamp_unit='us')
   
   iceberg_table = return catalog.create_table_if_not_exists(iceberg_table, 
schema, partition_spec=partition_spec,
       properties={
           'downcast-ns-timestamp-to-us-on-write': True,
           PYARROW_USE_LARGE_TYPES_ON_READ: True
       })
   
   iceberg_table.overwrite(arrow_table )
   ```
   Kindly help me resolve this issue.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

[I] Getting "offset overflow while concatenating arrays" Error when writing to iceberg [iceberg-python]

Reply via email to