[GitHub] [iceberg] Fokko opened a new issue, #6945: Python: Inconsistency around timezones

via GitHub Mon, 27 Feb 2023 00:39:47 -0800


Fokko opened a new issue, #6945:
URL: https://github.com/apache/iceberg/issues/6945


   ### Apache Iceberg version
   
   main (development)
   
   ### Query engine
   
   Other
   
   ### Please describe the bug 🐞
   
   With PyIceberg when we filter a complete DataFile, we end up with:
   ```
   ArrowInvalid: Schema at index 1 was different: 
   vendor_id: int32
   pickup_time: timestamp[us, tz=+00:00]
   pickup_location_id: int32
   dropoff_time: timestamp[us, tz=+00:00]
   dropoff_location_id: int32
   passenger_count: int32
   trip_distance: double
   ratecode_id: int32
   payment_type: int32
   total_amount: double
   fare_amount: double
   tip_amount: double
   tolls_amount: double
   mta_tax: double
   improvement_surcharge: double
   congestion_surcharge: double
   extra_surcharges: double
   store_and_forward_flag: string
   vs
   vendor_id: int32
   pickup_time: timestamp[us, tz=UTC]
   pickup_location_id: int32
   dropoff_time: timestamp[us, tz=UTC]
   dropoff_location_id: int32
   passenger_count: int32
   trip_distance: double
   ratecode_id: int32
   payment_type: int32
   total_amount: double
   fare_amount: double
   tip_amount: double
   tolls_amount: double
   mta_tax: double
   improvement_surcharge: double
   congestion_surcharge: double
   extra_surcharges: double
   store_and_forward_flag: string
   ```
   
   We get a `+00:00` from the empty tables that we're `concat`'ing, and a `UTC` 
from the ones that actually contain data:
   
![image](https://user-images.githubusercontent.com/1134248/221514436-9ad1e256-9567-4e4a-8524-8acb8ed62b77.png)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

[GitHub] [iceberg] Fokko opened a new issue, #6945: Python: Inconsistency around timezones

Reply via email to