Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-18 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2889491677 Once PyArrow 21.0.0 is released, we will be able to test the above case. Until then, I will wait. -- This is an automated message from the Apache Git Service. To respond

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-12 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2874603831 @kevinjqliu There is nothing. I ran the gsutil ls command beforehand, and the seoul_data_v1/warehouse/test/ directory did not exist. As you know, 0-6f9631fe-18

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-09 Thread via GitHub
kevinjqliu commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2867856930 hmmm. [this](https://github.com/apache/arrow/blob/067fd2a2c6e54d33b9ae8a3324f59bebe960d485/cpp/src/arrow/filesystem/gcsfs.cc#L351-L360) is saying that the `location` is a

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-06 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2856146318 @kevinjqliu location = 'gs://seoul_data_v1/warehouse/test/metadata/0-6f9631fe-18db-4743-915b-36d7562d4073.metadata.json' -- This is an automated message from t

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-06 Thread via GitHub
kevinjqliu commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2855107815 Thanks for the debug log! > FileInfo for location : type=FileType.Directory is definitely the issue. we [check if the FIleType is NotFound](https://github

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-05 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2853358371 My ec2 is provisioned on GCP. Basically they set the gcloud account owner. So I can view the bucket inside. ![Image](https://github.com/user-attachments/assets/ae64

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-05 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2852821883 Sure @kevinjqliu :) Below the image, top of the logs is the return type. FileInfo for location : type=FileType.Directory https://github.com/user-attachme

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-05 Thread via GitHub
kevinjqliu commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2851881539 Interesting, so [`self.exists()`](https://github.com/apache/iceberg-python/blob/34c89494c39916b9b1aa7e6da2c24c34c4d7f058/pyiceberg/io/pyarrow.py#L344) here returns`True`.

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-05 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2851220164 I think this might be the clue that’s worth digging into further: https://github.com/apache/arrow/blob/main/cpp/src/arrow/filesystem/gcsfs.cc ``` c++ Result Ge

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-05 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2851040172 Hi, @kevinjqliu :) Thank you for summarizing. > Looking at the screenshot. It looks like you're running create_table with a GCS table location. On table creation, the

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-04 Thread via GitHub
kevinjqliu commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2849293475 Hi @HyunWooZZ thanks for raising this issue! I want to summary my understanding > I had a problem writing an Iceberg table to GCS via the JDBC catalog, but I solve

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-02 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2847423297 I added some PyIceberg code that retrieves file information from the Arrow file system to make it easier for you to find. [iceberg-python.pyiceberg.io.pyarrow.py](ht

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-02 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2847405490 Thanks for replying, @Fokko! Yes :) I'm definitely interested in fixing the above. Yesterday, I found something related to the above log: However, is it our resp

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-05-02 Thread via GitHub
Fokko commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2847042380 Thanks for raising this @HyunWooZZ! 🙌 Are you interested in providing a fix? -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-04-28 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2836984873 I found where that file location information come from! https://github.com/fsspec/filesystem_spec/blob/master/fsspec/spec.py#L97 ![Image](https://github.com/u

Re: [I] Clarify Error Logs for GCS Directory/Object Conflicts [iceberg-python]

2025-04-28 Thread via GitHub
HyunWooZZ commented on issue #1952: URL: https://github.com/apache/iceberg-python/issues/1952#issuecomment-2836935894 Also, in the Object Storage Engine, it is weird that there is a directory type. -- This is an automated message from the Apache Git Service. To respond to the message, pl