[I] Snowflake driver logs at info level and perhaps could log fewer messages [arrow-adbc]

2025-03-11 Thread via GitHub
paleolimbot opened a new issue, #2607: URL: https://github.com/apache/arrow-adbc/issues/2607 ### What feature or improvement would you like to see? In we see R CMD check failures for the snowflake driver (probably) because of a log message emitted during the examples (and/or possibly

[I] [R][Packaging] Update R packaging checklist with latest process [arrow]

2025-03-11 Thread via GitHub
amoeba opened a new issue, #45691: URL: https://github.com/apache/arrow/issues/45691 ### Describe the enhancement requested It's been a few releases since we reviewed the R packaging checklist and I see a few things we should update. ### Component(s) R -- This is an a

Re: [I] [C++] Bump Apache ORC to 2.1.1 [arrow]

2025-03-11 Thread via GitHub
wgtmac closed issue #45689: [C++] Bump Apache ORC to 2.1.1 URL: https://github.com/apache/arrow/issues/45689 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-m

[I] [C++] Pass shared_ptr by value to parametric type constructors [arrow]

2025-03-11 Thread via GitHub
felipecrv opened a new issue, #37891: URL: https://github.com/apache/arrow/issues/37891 ### Describe the enhancement requested Types like `ListType` and factory functions like `list` take a `const std::shared_ptr&` instead of a `std::shared_ptr` that could be moved into the newly con

Re: [I] go/adbc/driver/flightsql: performance loss in go1.22.5/go1.21.12 [arrow-adbc]

2025-03-11 Thread via GitHub
lidavidm closed issue #2021: go/adbc/driver/flightsql: performance loss in go1.22.5/go1.21.12 URL: https://github.com/apache/arrow-adbc/issues/2021 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[I] add ordered and AppMetaData to FlightInfo.cs [arrow]

2025-03-11 Thread via GitHub
cmettler opened a new issue, #45752: URL: https://github.com/apache/arrow/issues/45752 ### Describe the enhancement requested The flight FieldInfo protocol defines two fields Ordered and AppMetaData which are not assigned in the Apache.Arrow.Flight.FlightInfo.cs ### Component(s

[I] [R] Review 19.0.X C++ releases and add any missing functionality to the R package [arrow]

2025-03-11 Thread via GitHub
amoeba opened a new issue, #45731: URL: https://github.com/apache/arrow/issues/45731 ### Describe the enhancement requested Before the 20.0.0 release, we should review the 19.0.X releases for any changes we missed adding bindings to in the R package so it stays in sync. Neal noted th

[I] `compute::LocalTimestamp()` Resulting in incorrect conversion [arrow]

2025-03-11 Thread via GitHub
gowerc opened a new issue, #45751: URL: https://github.com/apache/arrow/issues/45751 ### Describe the bug, including details regarding any error messages, version, and platform. Apologies in advance if I've made a mistake here I am relatively new to the arrow Cpp API and also to mana

[I] Support Variant Encoding for Parquet [arrow-go]

2025-03-11 Thread via GitHub
sfc-gh-mbojanczyk opened a new issue, #310: URL: https://github.com/apache/arrow-go/issues/310 ### Describe the enhancement requested [Variant encoding](https://github.com/apache/parquet-format/blob/master/VariantEncoding.md) looks like it's becoming part of the Parquet spec, and it'

[I] [Python] `pyarrow.compute.skew(skip_nulls=True)` still counts NULL as an observation? [arrow]

2025-03-11 Thread via GitHub
mroeschke opened a new issue, #45733: URL: https://github.com/apache/arrow/issues/45733 ### Describe the bug, including details regarding any error messages, version, and platform. ```python >>> import pyarrow as pa >>> import pyarrow.compute as pc >>> import pandas as pd, nu

[I] [C++] FixedSizeListBuilder should have UnsafeAppend methods [arrow]

2025-03-11 Thread via GitHub
pitrou opened a new issue, #45723: URL: https://github.com/apache/arrow/issues/45723 ### Describe the enhancement requested The following methods are missing from `FixedSizeListBuilder` and would be useful for high-performance fixed-size-list array building: * `void UnsafeAppen

[I] python flight client: modify 'user-agent' header from client using ClientMiddleware [arrow]

2025-03-11 Thread via GitHub
xshirax opened a new issue, #45730: URL: https://github.com/apache/arrow/issues/45730 ### Describe the usage question you have. Please include as many useful details as possible. hey i want to add additional information to the existing user-agent header in my http request.

[I] Datafusion driver crate rust-version [arrow-adbc]

2025-03-11 Thread via GitHub
mbrobbel opened a new issue, #2591: URL: https://github.com/apache/arrow-adbc/issues/2591 think we need to split workspaces for another reason: currently we claim we only need rust 1.80 but DataFusion needs rust 1.81. So I was unable to upload v0.17.0 to crates.io. ``` error: fai

[I] [C++] StructBuilder should have UnsafeAppend methods [arrow]

2025-03-11 Thread via GitHub
pitrou opened a new issue, #45722: URL: https://github.com/apache/arrow/issues/45722 ### Describe the enhancement requested The following methods are missing from `StructBuilder` and would be useful for high-performance struct array building: * `void UnsafeAppend()` * `void Unsa

Re: [I] c: use target_link_libraries instead of include_directories [arrow-adbc]

2025-03-11 Thread via GitHub
lidavidm closed issue #608: c: use target_link_libraries instead of include_directories URL: https://github.com/apache/arrow-adbc/issues/608 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

Re: [I] Datafusion driver crate rust-version [arrow-adbc]

2025-03-11 Thread via GitHub
lidavidm closed issue #2591: Datafusion driver crate rust-version URL: https://github.com/apache/arrow-adbc/issues/2591 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

Re: [I] [C++] Add possibility to extract spans/byte offsets directly for `compute.extract_regex` [arrow]

2025-03-11 Thread via GitHub
pitrou closed issue #44615: [C++] Add possibility to extract spans/byte offsets directly for `compute.extract_regex` URL: https://github.com/apache/arrow/issues/44615 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

[I] [C++][Python][Parquet] Support Content-Defined Chunking of Parquet files [arrow]

2025-03-11 Thread via GitHub
kszucs opened a new issue, #45750: URL: https://github.com/apache/arrow/issues/45750 ### Describe the enhancement requested ## Rationale Unlike the traditional approach where pages are split once a page's size reaches the default limit (typically 1MB), this implementation split

Re: [I] [C++][Python] hash_pivot_wider crashes if called without options [arrow]

2025-03-11 Thread via GitHub
pitrou closed issue #45739: [C++][Python] hash_pivot_wider crashes if called without options URL: https://github.com/apache/arrow/issues/45739 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[I] [C++] Thread pool performance behaves counter-intuitive for I/O intensive operations (e.g. file system) [arrow]

2025-03-11 Thread via GitHub
OliLay opened a new issue, #45749: URL: https://github.com/apache/arrow/issues/45749 ### Describe the bug, including details regarding any error messages, version, and platform. Hey guys, we noticed something with using S3/Azure Blob and the arrow file system implementation. W

[I] [C++] Remove deprecated GetNextSegment [arrow]

2025-03-11 Thread via GitHub
AlenkaF opened a new issue, #45744: URL: https://github.com/apache/arrow/issues/45744 ### Describe the enhancement requested Remove `GetNextSegment` and related from compute/row/grouper.cc, the code has been deprecated in 18.0.0. See: https://github.com/apache/arrow/pull/44053

[I] [C++] Remove deprecated functions in 20.0 (C++) [arrow]

2025-03-11 Thread via GitHub
AlenkaF opened a new issue, #45746: URL: https://github.com/apache/arrow/issues/45746 ### Describe the enhancement requested Remove subset of deprecated functions listed in #45688: - `GetCapacity` in Gandiva cache.cc from [GH-41329](https://github.com/apache/arrow/pull/41330 )(

Re: [I] [C#] C# builds are failing on MacOS [arrow]

2025-03-11 Thread via GitHub
CurtHagenlocher closed issue #45656: [C#] C# builds are failing on MacOS URL: https://github.com/apache/arrow/issues/45656 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

Re: [I] csharp: builds are failing on MacOS [arrow-adbc]

2025-03-11 Thread via GitHub
CurtHagenlocher closed issue #2568: csharp: builds are failing on MacOS URL: https://github.com/apache/arrow-adbc/issues/2568 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] [Python][FlightRPC] Modify 'user-agent' header from client using ClientMiddleware [arrow]

2025-03-11 Thread via GitHub
xshirax closed issue #45730: [Python][FlightRPC] Modify 'user-agent' header from client using ClientMiddleware URL: https://github.com/apache/arrow/issues/45730 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abov

Re: [I] csharp: ValueAt extension causes error when StringArray length = 0 [arrow-adbc]

2025-03-11 Thread via GitHub
CurtHagenlocher closed issue #2586: csharp: ValueAt extension causes error when StringArray length = 0 URL: https://github.com/apache/arrow-adbc/issues/2586 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[I] [C++][Compute] Hash aggregation is slowish [arrow]

2025-03-11 Thread via GitHub
pitrou opened a new issue, #45741: URL: https://github.com/apache/arrow/issues/45741 ### Describe the enhancement requested Running some simple benchmarks from Python, I was a bit surprised by the performance of group-by aggregations: * 1000 groups: ```pycon >>> n = 1

[I] [C++][Python] hash_pivot_wider crashes if called without options [arrow]

2025-03-11 Thread via GitHub
pitrou opened a new issue, #45739: URL: https://github.com/apache/arrow/issues/45739 ### Describe the bug, including details regarding any error messages, version, and platform. ```pycon >>> a = pa.table({'group': [1,2,3,1,2,3], 'key': ['h', 'h', 'h', 'w', 'w', 'w'], 'value': [10,

Re: [I] [C++][Compute] Split up `hash_aggregate.cc` [arrow]

2025-03-11 Thread via GitHub
pitrou closed issue #45563: [C++][Compute] Split up `hash_aggregate.cc` URL: https://github.com/apache/arrow/issues/45563 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[I] [Archery] test-conda-python-3.13 failing due to setuptools hash [arrow]

2025-03-11 Thread via GitHub
AlenkaF opened a new issue, #45737: URL: https://github.com/apache/arrow/issues/45737 ### Describe the bug, including details regarding any error messages, version, and platform. The [test-conda-python-3.13](https://github.com/ursacomputing/crossbow/actions/runs/13777681150/job/38530

Re: [I] [Docs] Fix outdated `ubuntu-docs` docker image name to `debian-docs` [arrow]

2025-03-11 Thread via GitHub
kou closed issue #45724: [Docs] Fix outdated `ubuntu-docs` docker image name to `debian-docs` URL: https://github.com/apache/arrow/issues/45724 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [I] [C++][Acero] Missing check for vector size mismatch between names and expressions in ProjectNode [arrow]

2025-03-11 Thread via GitHub
zanmato1984 closed issue #45564: [C++][Acero] Missing check for vector size mismatch between names and expressions in ProjectNode URL: https://github.com/apache/arrow/issues/45564 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a