Re: [I] [BUG] pyiceberg hanging on multiprocessing [iceberg-python]

2025-01-09 Thread via GitHub
frankliee commented on issue #1488: URL: https://github.com/apache/iceberg-python/issues/1488#issuecomment-2581774240 I use strace on the worker process, there are `FUTEX_WAIT_BITSET_PRIVATE`, and I not sure it was dead locking caused by process forking in pyarrow. Then I find that

Re: [I] [BUG] pyiceberg hanging on multiprocessing [iceberg-python]

2025-01-09 Thread via GitHub
frankliee commented on issue #1488: URL: https://github.com/apache/iceberg-python/issues/1488#issuecomment-2579879173 > One thing we can test is to force create a new FileIO in the worker. Something like this > > ``` > from multiprocessing import Process > from pyiceberg.io.pya

Re: [I] [BUG] pyiceberg hanging on multiprocessing [iceberg-python]

2025-01-07 Thread via GitHub
kevinjqliu commented on issue #1488: URL: https://github.com/apache/iceberg-python/issues/1488#issuecomment-2575734864 One thing we can test is to force create a new FileIO in the worker. Something like this ``` from multiprocessing import Process from pyiceberg.io.pyarrow im

Re: [I] [BUG] pyiceberg hanging on multiprocessing [iceberg-python]

2025-01-06 Thread via GitHub
frankliee commented on issue #1488: URL: https://github.com/apache/iceberg-python/issues/1488#issuecomment-2574263503 @kevinjqliu I use pystack to get stack of child process, it shows that `pyarrow.FileSystem` causes the hanging. By the way, our env is not easy to upgrade pyiceberg t

Re: [I] [BUG] pyiceberg hanging on multiprocessing [iceberg-python]

2025-01-06 Thread via GitHub
kevinjqliu commented on issue #1488: URL: https://github.com/apache/iceberg-python/issues/1488#issuecomment-2573279503 Hi @frankliee thanks for reporting this issue. I noticed you're using version 0.7.1, the latest version is 0.8.1. Could you retry with the latest version? The issue