Re: [I] [CI] Modernize AppVeyor build [arrow]

2024-12-12 Thread via GitHub
pitrou closed issue #44997: [CI] Modernize AppVeyor build URL: https://github.com/apache/arrow/issues/44997 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-ma

Re: [I] [CI] Modernize AppVeyor build [arrow]

2024-12-12 Thread via GitHub
pitrou closed issue #44997: [CI] Modernize AppVeyor build URL: https://github.com/apache/arrow/issues/44997 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-ma

Re: [I] [CI][Python] `pyarrow.lib` fails loading on AppVeyor [arrow]

2024-12-12 Thread via GitHub
pitrou closed issue #44991: [CI][Python] `pyarrow.lib` fails loading on AppVeyor URL: https://github.com/apache/arrow/issues/44991 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[I] [CI][Python] Test failures in example builds [arrow]

2024-12-12 Thread via GitHub
pitrou opened a new issue, #45006: URL: https://github.com/apache/arrow/issues/45006 ### Describe the bug, including details regarding any error messages, version, and platform. Some nightly Python builds have started failing after https://github.com/apache/arrow/pull/44951 , for exa

[I] Add Fixed-Size List Support in Apache Arrow STL Conversion Traits [arrow]

2024-12-12 Thread via GitHub
mroz45 opened a new issue, #45005: URL: https://github.com/apache/arrow/issues/45005 ### Describe the enhancement requested Conversion of a range of std::tuple objects can be converted into Arrow Tables. However, while Arrow supports several standard C++ data types for this conversio

[I] iter_batches on a parquet file with zero row groups fails in pyarrow>=18 [arrow]

2024-12-12 Thread via GitHub
cedriccuypers opened a new issue, #45009: URL: https://github.com/apache/arrow/issues/45009 ### Describe the bug, including details regarding any error messages, version, and platform. We noticed a bug in pyarrow when we were trying to iterate in batches over parquet files, of which

[I] [pyarrow] `pyarrow.unique` gives garbage results with chunked dictionary arrays. [arrow]

2024-12-12 Thread via GitHub
Yeshwanth-G opened a new issue, #45010: URL: https://github.com/apache/arrow/issues/45010 ### Describe the bug, including details regarding any error messages, version, and platform. Consider: ``` In [9]: pyarrow.__version__ Out[9]: '18.0.0' In [10]: a = pa.DictionaryArray

[I] [CI][Python] mimalloc warnings in debug memory pool test on 32-bit Linux [arrow]

2024-12-12 Thread via GitHub
pitrou opened a new issue, #45011: URL: https://github.com/apache/arrow/issues/45011 ### Describe the bug, including details regarding any error messages, version, and platform. This failure should be investigated as it may point to a bogus memory allocation somewhere in Arrow: h

[I] [C++][Parquet] Allow increasing default footer read size [arrow]

2024-12-12 Thread via GitHub
zeroshade opened a new issue, #45015: URL: https://github.com/apache/arrow/issues/45015 ### Describe the enhancement requested The Parquet library currently requires multiple round trips to read the footer of a file whose metadata is >64KB. For high latency file systems like S3, it c

[I] [Pyarrow] Scanner crashing occasionally [arrow]

2024-12-12 Thread via GitHub
alexeykudinkin opened a new issue, #45017: URL: https://github.com/apache/arrow/issues/45017 ### Describe the bug, including details regarding any error messages, version, and platform. Occasionally seeing Pyarrow's `Scanner` resulting in following worker crashes in Ray Data:

[I] R: `write_adbc()` does not seem to work when writing to a Snowflake table [arrow-adbc]

2024-12-12 Thread via GitHub
liamnz opened a new issue, #2366: URL: https://github.com/apache/arrow-adbc/issues/2366 ### What happened? I'm using the `adbcdrivermanager` and `adbcsnowflake` packages in R and testing out writing a data-frame to a Snowflake table with `write_adbc()`. `write_adbc()` returns a

[I] [Java][CI] Building C++ libraries on Ubuntu aarch64 job takes 3+ hours to complete in java-jars [arrow-java]

2024-12-12 Thread via GitHub
danepitkin opened a new issue, #457: URL: https://github.com/apache/arrow-java/issues/457 ### Describe the enhancement requested Can we speed this up? Other OS/architectures take 10-30min. See java-jars job in crossbow: https://github.com/ursacomputing/crossbow/actions/runs/981

[I] Is there a max limit to PyArrow write_dataset row-group sizes [arrow]

2024-12-12 Thread via GitHub
theogaraj opened a new issue, #45018: URL: https://github.com/apache/arrow/issues/45018 ### Describe the usage question you have. Please include as many useful details as possible. NOTE: I posted this to StackOverflow a few days ago, reposting here for more focused attention of