[GitHub] [arrow-adbc] lidavidm closed issue #455: [Go] Error building go driver manager (no C++ standard set)

2023-03-02 Thread via GitHub
lidavidm closed issue #455: [Go] Error building go driver manager (no C++ standard set) URL: https://github.com/apache/arrow-adbc/issues/455 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[GitHub] [arrow] JonnyWaffles opened a new issue, #34414: slow s3 parquet reads when using fsspec S3FileSystem

2023-03-02 Thread via GitHub
JonnyWaffles opened a new issue, #34414: URL: https://github.com/apache/arrow/issues/34414 ### Describe the bug, including details regarding any error messages, version, and platform. Hi team, I hope you are well. Apologies if this is documented somewhere, I searched around but could

[GitHub] [arrow] no23reason opened a new issue, #34415: [Python]: Trying to read metadata without calling done_writing hangs both client and server

2023-03-02 Thread via GitHub
no23reason opened a new issue, #34415: URL: https://github.com/apache/arrow/issues/34415 ### Describe the bug, including details regarding any error messages, version, and platform. When implementing a custom `do_put` operation using `FlightServerBase`, if the client does not call `d

[GitHub] [arrow-adbc] lidavidm closed issue #469: [Go] Flight SQL: GetInfo shouldn't fail if GetSqlInfo isn't implemented

2023-03-02 Thread via GitHub
lidavidm closed issue #469: [Go] Flight SQL: GetInfo shouldn't fail if GetSqlInfo isn't implemented URL: https://github.com/apache/arrow-adbc/issues/469 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] assignUser closed issue #34414: slow s3 parquet reads when using fsspec S3FileSystem

2023-03-02 Thread via GitHub
assignUser closed issue #34414: slow s3 parquet reads when using fsspec S3FileSystem URL: https://github.com/apache/arrow/issues/34414 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

[GitHub] [arrow] no23reason closed issue #34415: [Python]: Trying to read metadata without calling done_writing hangs both client and server thread

2023-03-02 Thread via GitHub
no23reason closed issue #34415: [Python]: Trying to read metadata without calling done_writing hangs both client and server thread URL: https://github.com/apache/arrow/issues/34415 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [arrow] wjones127 opened a new issue, #34417: [C++][Flight] Remove deprecated OpenTelemetry header

2023-03-02 Thread via GitHub
wjones127 opened a new issue, #34417: URL: https://github.com/apache/arrow/issues/34417 ### Describe the enhancement requested This header was deprecated and was just removed in opentelemetry-cpp 1.8.2 https://github.com/open-telemetry/opentelemetry-cpp/blob/main/CHANGELOG.md#1

[GitHub] [arrow] kou closed issue #34256: [Release] Update release scripts with main as new default branch

2023-03-02 Thread via GitHub
kou closed issue #34256: [Release] Update release scripts with main as new default branch URL: https://github.com/apache/arrow/issues/34256 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[GitHub] [arrow-adbc] lidavidm closed issue #481: [CI] arrow-c-glib/red-arrow version needs to be pinned

2023-03-02 Thread via GitHub
lidavidm closed issue #481: [CI] arrow-c-glib/red-arrow version needs to be pinned URL: https://github.com/apache/arrow-adbc/issues/481 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [arrow] amoeba opened a new issue, #34421: [R] Allow GcsFileSystem$create to accept a path to json_credentials

2023-03-02 Thread via GitHub
amoeba opened a new issue, #34421: URL: https://github.com/apache/arrow/issues/34421 ### Describe the enhancement requested As discovered in https://github.com/apache/arrow/issues/33106, `GcsFileSystem$create` is documented to say it takes a path to a set of JSON credentials on disk.

[GitHub] [arrow] amoeba opened a new issue, #34422: [R] Expose GcsFileSystem options in R

2023-03-02 Thread via GitHub
amoeba opened a new issue, #34422: URL: https://github.com/apache/arrow/issues/34422 ### Describe the enhancement requested Related to https://github.com/apache/arrow/issues/34421 and commented as a TODO: https://github.com/apache/arrow/blob/50fb5b0df84a3ccaa665965e36a6e757d1a

[GitHub] [arrow] cwang9208 opened a new issue, #34423: pyarrow MemoryMappedFile close does not release memory

2023-03-02 Thread via GitHub
cwang9208 opened a new issue, #34423: URL: https://github.com/apache/arrow/issues/34423 ### Describe the usage question you have. Please include as many useful details as possible. `path = "" files = os.listdir(path) expr = pc.field("l_shipdate") <= datetime.date(1998, 12,

[GitHub] [arrow] swyatt7 closed issue #34403: [Python] Is there a way to construct the metadata_collector for an existing partitioned dataset?

2023-03-02 Thread via GitHub
swyatt7 closed issue #34403: [Python] Is there a way to construct the metadata_collector for an existing partitioned dataset? URL: https://github.com/apache/arrow/issues/34403 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and u

[GitHub] [arrow] heronshoes opened a new issue, #34425: [GLib] Add support for RankOptions

2023-03-03 Thread via GitHub
heronshoes opened a new issue, #34425: URL: https://github.com/apache/arrow/issues/34425 ### Describe the enhancement requested ### Describe the enhancement requested Target option RankOptions Proposed feature Add support for RankOptions. ##

[GitHub] [arrow] heronshoes opened a new issue, #34426: [C++, GLib, Ruby] Compute function 'rank' does not support ChunkedArray

2023-03-03 Thread via GitHub
heronshoes opened a new issue, #34426: URL: https://github.com/apache/arrow/issues/34426 ### Describe the bug, including details regarding any error messages, version, and platform. The issue `rank` from C++ compute function does not support ChunkedArray in Ruby. ``

[GitHub] [arrow] raulcd opened a new issue, #34427: [CI] Nightly report emails and Zulip messages are failing to be sent due to SMTP account failure on crossbow

2023-03-03 Thread via GitHub
raulcd opened a new issue, #34427: URL: https://github.com/apache/arrow/issues/34427 ### Describe the bug, including details regarding any error messages, version, and platform. As reported on Zulip since the 27th of February we stopped receiving emails and Zulip notifications for ni

[GitHub] [arrow] Fokko opened a new issue, #34428: [Python] Add missing docs on `make_fragement`

2023-03-03 Thread via GitHub
Fokko opened a new issue, #34428: URL: https://github.com/apache/arrow/issues/34428 ### Describe the enhancement requested I had to go to the code to see what we could pass into the `make_fragments` ### Component(s) Python -- This is an automated message from the Apach

[GitHub] [arrow] no23reason opened a new issue, #34431: [Python] Expose gRPC cancel to FlightStreamWriter

2023-03-03 Thread via GitHub
no23reason opened a new issue, #34431: URL: https://github.com/apache/arrow/issues/34431 ### Describe the enhancement requested ### Use case In certain situations, client code using the FlightClient to perform the DoPut operation might need to cancel half way through writing (f

[GitHub] [arrow] wgtmac opened a new issue, #34432: [C++][Java][IPC] Java reader cannot read compressed file created by C++ writer

2023-03-03 Thread via GitHub
wgtmac opened a new issue, #34432: URL: https://github.com/apache/arrow/issues/34432 ### Describe the bug, including details regarding any error messages, version, and platform. To reproduce the issue, use the C++ code below to write and read arrow IPC file. I use the 11.0.0 version.

[GitHub] [arrow] kylebarron opened a new issue, #34433: [Python]: Possible to evaluate `pyarrow.compute.Expression` without filter?

2023-03-03 Thread via GitHub
kylebarron opened a new issue, #34433: URL: https://github.com/apache/arrow/issues/34433 ### Describe the usage question you have. Please include as many useful details as possible. I'm unsure whether this should be categorized as "usage" or "enhancement". I've read throu

[GitHub] [arrow] kmkramer23 opened a new issue, #34434: read_csv_arrow stops reading the rest of the file during a conversion error

2023-03-03 Thread via GitHub
kmkramer23 opened a new issue, #34434: URL: https://github.com/apache/arrow/issues/34434 ### Describe the usage question you have. Please include as many useful details as possible. When reading a csv file in R using read_csv_arrow with a schema, the program will stop and not fi

[GitHub] [arrow] westonpace closed issue #34326: [C++][Parquet] Page null_count is incorrect if stats is disabled

2023-03-03 Thread via GitHub
westonpace closed issue #34326: [C++][Parquet] Page null_count is incorrect if stats is disabled URL: https://github.com/apache/arrow/issues/34326 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] nealrichardson opened a new issue, #34436: [R] Bindings for JSON Dataset

2023-03-03 Thread via GitHub
nealrichardson opened a new issue, #34436: URL: https://github.com/apache/arrow/issues/34436 ### Describe the enhancement requested See #33732 where it was added to the C++ library ### Component(s) R -- This is an automated message from the Apache Git Service. To respo

[GitHub] [arrow] nealrichardson opened a new issue, #34437: [R] Use FetchNode

2023-03-03 Thread via GitHub
nealrichardson opened a new issue, #34437: URL: https://github.com/apache/arrow/issues/34437 ### Describe the enhancement requested See #34059. There's at least [one workaround](https://github.com/apache/arrow/blob/main/r/R/query-engine.R#L224-L233) we can remove and push the work in

[GitHub] [arrow] wjones127 opened a new issue, #34438: [C++] Printing unicode characters can be confusing

2023-03-03 Thread via GitHub
wjones127 opened a new issue, #34438: URL: https://github.com/apache/arrow/issues/34438 ### Describe the enhancement requested The way that C++ prints out some characters can be confusing: ``` import pyarrow as pa arr = pa.array([" ", "\u0080", "\u0081", "\u0082"])

[GitHub] [arrow] wjones127 closed issue #34417: [C++][Flight] Remove deprecated OpenTelemetry header

2023-03-03 Thread via GitHub
wjones127 closed issue #34417: [C++][Flight] Remove deprecated OpenTelemetry header URL: https://github.com/apache/arrow/issues/34417 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [arrow] kou opened a new issue, #34440: [Ruby] `each` without block support is missing for `RecordBatch{File,Stream}Reader`

2023-03-03 Thread via GitHub
kou opened a new issue, #34440: URL: https://github.com/apache/arrow/issues/34440 ### Describe the enhancement requested The followings don't work: ```ruby reader = Arrow::RecordBatchFileReader.new(input) reader.each.to_a ``` ```ruby reader = Arrow::RecordBatc

[GitHub] [arrow] kou opened a new issue, #34442: [Ruby][FlightRPC] Add `ArrowFlight::StreamReader#each`

2023-03-03 Thread via GitHub
kou opened a new issue, #34442: URL: https://github.com/apache/arrow/issues/34442 ### Describe the enhancement requested All iteratable objects should have `#each` in Ruby. ### Component(s) FlightRPC, Ruby -- This is an automated message from the Apache Git Service. To

[GitHub] [arrow] chutchinson opened a new issue, #34443: RecordBatch.Builder does not respect field metadata

2023-03-03 Thread via GitHub
chutchinson opened a new issue, #34443: URL: https://github.com/apache/arrow/issues/34443 ### Describe the bug, including details regarding any error messages, version, and platform. The **RecordBatch.Builder** API does not respect field metadata when calling **Append**. The field an

[GitHub] [arrow] chutchinson closed issue #34443: [C#] RecordBatch.Builder does not respect field metadata

2023-03-03 Thread via GitHub
chutchinson closed issue #34443: [C#] RecordBatch.Builder does not respect field metadata URL: https://github.com/apache/arrow/issues/34443 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specif

[GitHub] [arrow] zeroshade closed issue #33466: [GO]: pqarrow (github.com/apache/arrow/go/v9/parquet/pqarrow) cannot handle arrow's DICTIONARY field

2023-03-03 Thread via GitHub
zeroshade closed issue #33466: [GO]: pqarrow (github.com/apache/arrow/go/v9/parquet/pqarrow) cannot handle arrow's DICTIONARY field URL: https://github.com/apache/arrow/issues/33466 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [arrow] wjones127 opened a new issue, #34446: [C++] Failing RecordReaderPrimitveTypeTests

2023-03-03 Thread via GitHub
wjones127 opened a new issue, #34446: URL: https://github.com/apache/arrow/issues/34446 ### Describe the bug, including details regarding any error messages, version, and platform. See: https://github.com/apache/arrow/actions/runs/4326661366/jobs/7554348864 From #17877 #

[GitHub] [arrow] rok closed issue #34210: [C++] Make casting timestamp and duration zero-copy when TimeUnit matches

2023-03-03 Thread via GitHub
rok closed issue #34210: [C++] Make casting timestamp and duration zero-copy when TimeUnit matches URL: https://github.com/apache/arrow/issues/34210 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[GitHub] [arrow] assignUser closed issue #34446: [C++] Failing RecordReaderPrimitveTypeTests

2023-03-03 Thread via GitHub
assignUser closed issue #34446: [C++] Failing RecordReaderPrimitveTypeTests URL: https://github.com/apache/arrow/issues/34446 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow] westonpace closed issue #34076: [C#] Exception thrown when creating schema with multiple columns with the same name

2023-03-03 Thread via GitHub
westonpace closed issue #34076: [C#] Exception thrown when creating schema with multiple columns with the same name URL: https://github.com/apache/arrow/issues/34076 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] j-bennet opened a new issue, #34449: [Python] `to_parquet` fails with a category field backed by pyarrow string

2023-03-03 Thread via GitHub
j-bennet opened a new issue, #34449: URL: https://github.com/apache/arrow/issues/34449 ### Describe the bug, including details regarding any error messages, version, and platform. Originally filed in pandas-dev, but was told that the issue is in pyarrow: * https://github.com/pa

[GitHub] [arrow] westonpace closed issue #34388: [C++] Build core compute kernels unconditionally

2023-03-03 Thread via GitHub
westonpace closed issue #34388: [C++] Build core compute kernels unconditionally URL: https://github.com/apache/arrow/issues/34388 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [arrow] coady opened a new issue, #34451: [C++][Python] A metadata standard for sorted datasets.

2023-03-03 Thread via GitHub
coady opened a new issue, #34451: URL: https://github.com/apache/arrow/issues/34451 ### Describe the enhancement requested Split off from #34153. In order to take advantage of sorted columns, it would be necessary for arrow to standardize on a way to represent sorting in datase

[GitHub] [arrow] westonpace closed issue #34266: [C++] Add a pivot_longer node

2023-03-03 Thread via GitHub
westonpace closed issue #34266: [C++] Add a pivot_longer node URL: https://github.com/apache/arrow/issues/34266 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[GitHub] [arrow] lichangqing527 opened a new issue, #34452: There is a problem with building arrow with maven

2023-03-04 Thread via GitHub
lichangqing527 opened a new issue, #34452: URL: https://github.com/apache/arrow/issues/34452 ### Describe the usage question you have. Please include as many useful details as possible. The system is ubuntu when i execute the following command: ` mvn generate-resources

[GitHub] [arrow] Kesanov closed issue #34274: [Python] High RAM usage when reading parquet.

2023-03-04 Thread via GitHub
Kesanov closed issue #34274: [Python] High RAM usage when reading parquet. URL: https://github.com/apache/arrow/issues/34274 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow] yevgenypats opened a new issue, #34453: [GO] Extension Builder Interface

2023-03-04 Thread via GitHub
yevgenypats opened a new issue, #34453: URL: https://github.com/apache/arrow/issues/34453 ### Describe the enhancement requested (This was also send in the mailing list and discussed shortly with @zeroshade ). Copy of what was sent in the mailing list and a PR will quickly follow:

[GitHub] [arrow] plamb-viso opened a new issue, #34455: ArrowNotImplementedError: concatenation of extension>

2023-03-04 Thread via GitHub
plamb-viso opened a new issue, #34455: URL: https://github.com/apache/arrow/issues/34455 ### Describe the usage question you have. Please include as many useful details as possible. I'm using Huggingface Datasets to encode a dataset which uses pyarrow under the covers. The mappe

[GitHub] [arrow] ankane opened a new issue, #34457: [Ruby] Ruby library (silently) installs system libraries

2023-03-04 Thread via GitHub
ankane opened a new issue, #34457: URL: https://github.com/apache/arrow/issues/34457 ### Describe the bug, including details regarding any error messages, version, and platform. Hi, the `red-arrow` gem and its `gobject-introspection` dependency currently install a number of system li

[GitHub] [arrow] kou closed issue #34442: [Ruby][FlightRPC] Add `ArrowFlight::RecordBatchReader#each`

2023-03-04 Thread via GitHub
kou closed issue #34442: [Ruby][FlightRPC] Add `ArrowFlight::RecordBatchReader#each` URL: https://github.com/apache/arrow/issues/34442 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

[GitHub] [arrow] kou closed issue #34440: [Ruby] `each` without block support is missing for `RecordBatch{File,Stream}Reader`

2023-03-04 Thread via GitHub
kou closed issue #34440: [Ruby] `each` without block support is missing for `RecordBatch{File,Stream}Reader` URL: https://github.com/apache/arrow/issues/34440 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] Bramtimm opened a new issue, #34459: Segmentation fault when trying to connect to AWS S3 Storage on CentOS 7 – Amazon Linux 2

2023-03-05 Thread via GitHub
Bramtimm opened a new issue, #34459: URL: https://github.com/apache/arrow/issues/34459 ### Describe the bug, including details regarding any error messages, version, and platform. Hi there, Since Arrow 11.0.2 we run into a segmentation fault when trying to connect to our A

[GitHub] [arrow] Hor911 opened a new issue, #34460: [C++] Split arrow::FileReader::ReadRowGroups() to 2 methods for flexible async IO

2023-03-05 Thread via GitHub
Hor911 opened a new issue, #34460: URL: https://github.com/apache/arrow/issues/34460 ### Describe the enhancement requested Current implementation of arrow::FileReader::ReadRowGroups() does 2 things: 1. Triggers pre-buffering if enabled 2. Blocks until all data is received an

[GitHub] [arrow] kou closed issue #34425: [GLib] Add support for RankOptions

2023-03-05 Thread via GitHub
kou closed issue #34425: [GLib] Add support for RankOptions URL: https://github.com/apache/arrow/issues/34425 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-

[GitHub] [arrow] lichangqing527 closed issue #34452: [Java] There is a problem with building arrow with maven

2023-03-05 Thread via GitHub
lichangqing527 closed issue #34452: [Java] There is a problem with building arrow with maven URL: https://github.com/apache/arrow/issues/34452 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [arrow] AlenkaF closed issue #34449: [Python] `to_parquet` fails with a category field backed by pyarrow string

2023-03-06 Thread via GitHub
AlenkaF closed issue #34449: [Python] `to_parquet` fails with a category field backed by pyarrow string URL: https://github.com/apache/arrow/issues/34449 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] thisisnic closed issue #34398: [R] Update NEWS.md for 11.0.0.3

2023-03-06 Thread via GitHub
thisisnic closed issue #34398: [R] Update NEWS.md for 11.0.0.3 URL: https://github.com/apache/arrow/issues/34398 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[GitHub] [arrow] lukemanley opened a new issue, #34462: Support non-nanosecond units in method to_pandas_dtype for timestamp and duration types

2023-03-06 Thread via GitHub
lukemanley opened a new issue, #34462: URL: https://github.com/apache/arrow/issues/34462 ### Describe the enhancement requested Pyarrow `timestamp(unit).to_pandas_dtype()` and `duration(unit).to_pandas_dtype` are currently returning `dtype('>> import pyarrow as pa >>> pa.timest

[GitHub] [arrow] thisisnic opened a new issue, #34464: [R] Missing rlang import - inform

2023-03-06 Thread via GitHub
thisisnic opened a new issue, #34464: URL: https://github.com/apache/arrow/issues/34464 ### Describe the bug, including details regarding any error messages, version, and platform. Checks fail as `rlang::inform()` should be added to the package's NAMESPACE ``` ── Error ('tes

[GitHub] [arrow] thisisnic closed issue #34464: [R] Missing rlang import - inform

2023-03-06 Thread via GitHub
thisisnic closed issue #34464: [R] Missing rlang import - inform URL: https://github.com/apache/arrow/issues/34464 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscrib

[GitHub] [arrow] thisisnic opened a new issue, #34467: [R] Disable DuckDB tests on R versions < 4.0.0

2023-03-06 Thread via GitHub
thisisnic opened a new issue, #34467: URL: https://github.com/apache/arrow/issues/34467 ### Describe the bug, including details regarding any error messages, version, and platform. DuckDB tests are failing for R < 4.0.0 due to a newly released version of DuckDB which uses a function

[GitHub] [arrow] wjones127 closed issue #28737: [C++][Parquet] Decoding byte stream split encoded columns fails when it has null values

2023-03-06 Thread via GitHub
wjones127 closed issue #28737: [C++][Parquet] Decoding byte stream split encoded columns fails when it has null values URL: https://github.com/apache/arrow/issues/28737 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [arrow] wjones127 closed issue #29912: [Python] Add support for the SSO credential provider

2023-03-06 Thread via GitHub
wjones127 closed issue #29912: [Python] Add support for the SSO credential provider URL: https://github.com/apache/arrow/issues/29912 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [arrow-adbc] lidavidm opened a new issue, #491: go/driver/flightsql: append ADBC/Flight to user-agent

2023-03-06 Thread via GitHub
lidavidm opened a new issue, #491: URL: https://github.com/apache/arrow-adbc/issues/491 (Also, upstream should put Flight/Flight SQL in the user agent.) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-adbc] lidavidm opened a new issue, #492: go/adbc/driver/flightsql: add extra logging

2023-03-06 Thread via GitHub
lidavidm opened a new issue, #492: URL: https://github.com/apache/arrow-adbc/issues/492 grpc-go doesn't log outgoing headers - ideally we should have a way to enable logging of headers + Flight/Flight SQL operation for debugging -- This is an automated message from the Apache Git Service.

[GitHub] [arrow] lidavidm opened a new issue, #34472: [Go] Flight SQL client does not drain DoAction streams

2023-03-06 Thread via GitHub
lidavidm opened a new issue, #34472: URL: https://github.com/apache/arrow/issues/34472 ### Describe the bug, including details regarding any error messages, version, and platform. This is similar to a bug that was in the C++ Flight SQL client. This means that certain operations won't

[GitHub] [arrow-adbc] lidavidm opened a new issue, #493: go/adbc/driver/flightsql: close() on a prepared statement does not send headers

2023-03-06 Thread via GitHub
lidavidm opened a new issue, #493: URL: https://github.com/apache/arrow-adbc/issues/493 Due to https://github.com/apache/arrow/issues/34472 + a bug in this driver -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow-adbc] lidavidm closed issue #482: [Go] driver/flightsql: providing any timeout causes everything to timeout instantly

2023-03-06 Thread via GitHub
lidavidm closed issue #482: [Go] driver/flightsql: providing any timeout causes everything to timeout instantly URL: https://github.com/apache/arrow-adbc/issues/482 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] westonpace closed issue #34262: [C++] Support for union type in ORC writer

2023-03-06 Thread via GitHub
westonpace closed issue #34262: [C++] Support for union type in ORC writer URL: https://github.com/apache/arrow/issues/34262 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow] ericlin4 opened a new issue, #34474: Table.join() produces incorrect results for large inputs

2023-03-06 Thread via GitHub
ericlin4 opened a new issue, #34474: URL: https://github.com/apache/arrow/issues/34474 ### Describe the bug, including details regarding any error messages, version, and platform. Pyarrow's join does not produce the same results as Pandas when the input tables are large. I am observi

[GitHub] [arrow] rtpsw opened a new issue, #34475: [C++] Optimize ordered aggregation

2023-03-06 Thread via GitHub
rtpsw opened a new issue, #34475: URL: https://github.com/apache/arrow/issues/34475 ### Describe the enhancement requested Optimize ordered aggregation to take advantage of (and enforce) ordering. This is a follow-up on [this discussion](https://github.com/apache/arrow/pull/34311#dis

[GitHub] [arrow] wjones127 closed issue #34410: [Python][C++] No longer possible to specify higher chunksize than the default for Parquet writing

2023-03-06 Thread via GitHub
wjones127 closed issue #34410: [Python][C++] No longer possible to specify higher chunksize than the default for Parquet writing URL: https://github.com/apache/arrow/issues/34410 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

[GitHub] [arrow-ballista-python] BubbaJoe opened a new issue, #27: module 'ballista' has no attribute 'BallistaContext'

2023-03-06 Thread via GitHub
BubbaJoe opened a new issue, #27: URL: https://github.com/apache/arrow-ballista-python/issues/27 **Describe the bug** I am trying to connect to ballista cluster, but documentation or something must be wrong. https://arrow.apache.org/ballista/user-guide/python.html **To Repro

[GitHub] [arrow] ankane closed issue #34457: [Ruby] Ruby library (silently) installs system libraries

2023-03-06 Thread via GitHub
ankane closed issue #34457: [Ruby] Ruby library (silently) installs system libraries URL: https://github.com/apache/arrow/issues/34457 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

[GitHub] [arrow] engimatic closed issue #34021: [Java] BufferAllocator allocated memory always increase

2023-03-06 Thread via GitHub
engimatic closed issue #34021: [Java] BufferAllocator allocated memory always increase URL: https://github.com/apache/arrow/issues/34021 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow] cwang9208 opened a new issue, #34478: Flight RPC throughput drops when transfer large messages

2023-03-06 Thread via GitHub
cwang9208 opened a new issue, #34478: URL: https://github.com/apache/arrow/issues/34478 ### Describe the usage question you have. Please include as many useful details as possible. I'm benchmarking the performance of flight RPC using the built-in C++ benchmark server/client. Bot

[GitHub] [arrow] cwang9208 closed issue #34478: [C++][FlightRPC] Flight RPC throughput drops when transfer large messages

2023-03-06 Thread via GitHub
cwang9208 closed issue #34478: [C++][FlightRPC] Flight RPC throughput drops when transfer large messages URL: https://github.com/apache/arrow/issues/34478 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[GitHub] [arrow] js8544 opened a new issue, #34479: [Java] java-jars failing due to conflicting slf4j bindings

2023-03-07 Thread via GitHub
js8544 opened a new issue, #34479: URL: https://github.com/apache/arrow/issues/34479 ### Describe the bug, including details regarding any error messages, version, and platform. Example failure build: https://github.com/ursacomputing/crossbow/actions/runs/4341450290/jobs/7582202736

[GitHub] [arrow] raulcd opened a new issue, #34481: [CI] Migrate ARM jobs from Travis to new self-hosted runners

2023-03-07 Thread via GitHub
raulcd opened a new issue, #34481: URL: https://github.com/apache/arrow/issues/34481 ### Describe the enhancement requested As per https://github.com/apache/arrow/issues/20496 we have to migrate our existing Travis CI jobs from the Apache repository. On this issue I plan to cover t

[GitHub] [arrow] lidavidm closed issue #34479: [Java] java-jars failing due to conflicting slf4j bindings

2023-03-07 Thread via GitHub
lidavidm closed issue #34479: [Java] java-jars failing due to conflicting slf4j bindings URL: https://github.com/apache/arrow/issues/34479 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

[GitHub] [arrow] paleolimbot opened a new issue, #34484: [C++] Substrait join results in all zeroes on the righthand side of the join

2023-03-07 Thread via GitHub
paleolimbot opened a new issue, #34484: URL: https://github.com/apache/arrow/issues/34484 ### Describe the bug, including details regarding any error messages, version, and platform. Very possible that there's something wrong with my plan here! Reproducer via the R bindings: `

[GitHub] [arrow] lidavidm opened a new issue, #34485: [Format][FlightRPC] Transfer FlightData in pieces

2023-03-07 Thread via GitHub
lidavidm opened a new issue, #34485: URL: https://github.com/apache/arrow/issues/34485 ### Describe the enhancement requested gRPC presents a message-based interface (not a stream-based interface like HTTP). It also limits the size of individual messages by default. For Flight, this

[GitHub] [arrow] hinxx opened a new issue, #34486: Converting arrow sparse_union table to pandas dataframe

2023-03-07 Thread via GitHub
hinxx opened a new issue, #34486: URL: https://github.com/apache/arrow/issues/34486 ### Describe the bug, including details regarding any error messages, version, and platform. I can not seem to convert pyarrow sparse_union table to pandas dataframe: ``` >>> table pyarrow

[GitHub] [arrow] r2evans opened a new issue, #34487: memory allocation crash

2023-03-07 Thread via GitHub
r2evans opened a new issue, #34487: URL: https://github.com/apache/arrow/issues/34487 ### Describe the bug, including details regarding any error messages, version, and platform. Motived by https://stackoverflow.com/questions/75657380/readr-vs-data-table-different-results-on-fedora,

[GitHub] [arrow] westonpace closed issue #34428: [Python] Add missing docs on `make_fragement`

2023-03-07 Thread via GitHub
westonpace closed issue #34428: [Python] Add missing docs on `make_fragement` URL: https://github.com/apache/arrow/issues/34428 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] zeroshade closed issue #34385: [Go] Read IPC files with compression enabled but uncompressed buffers

2023-03-07 Thread via GitHub
zeroshade closed issue #34385: [Go] Read IPC files with compression enabled but uncompressed buffers URL: https://github.com/apache/arrow/issues/34385 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] assignUser closed issue #34481: [CI] Migrate ARM jobs from Travis to new self-hosted runners

2023-03-07 Thread via GitHub
assignUser closed issue #34481: [CI] Migrate ARM jobs from Travis to new self-hosted runners URL: https://github.com/apache/arrow/issues/34481 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [arrow] lidavidm closed issue #34472: [Go][FlightRPC] Flight SQL client does not drain DoAction streams

2023-03-07 Thread via GitHub
lidavidm closed issue #34472: [Go][FlightRPC] Flight SQL client does not drain DoAction streams URL: https://github.com/apache/arrow/issues/34472 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] alippai opened a new issue, #34491: [Python] Add pyarrow.TableGroupBy() subtables

2023-03-07 Thread via GitHub
alippai opened a new issue, #34491: URL: https://github.com/apache/arrow/issues/34491 ### Describe the enhancement requested ```python import pyarrow as pa t = pa.table([ pa.array(["a", "a", "b", "b", "c"]), pa.array([1, 2, 3, 4, 5]), ], names=["keys", "value

[GitHub] [arrow] aaron-mongodb opened a new issue, #34492: PlainBooleanEncoder occasionally emits incorrect data over page boundaries

2023-03-07 Thread via GitHub
aaron-mongodb opened a new issue, #34492: URL: https://github.com/apache/arrow/issues/34492 ### Describe the bug, including details regarding any error messages, version, and platform. The `PlainBooleanEncoder` is used in a way indicating that `FlushValues` "resets" its state. Howeve

[GitHub] [arrow-julia] palday opened a new issue, #391: Precompilation broken on Julia 1.9-rc1

2023-03-07 Thread via GitHub
palday opened a new issue, #391: URL: https://github.com/apache/arrow-julia/issues/391 The lines impacted are https://github.com/apache/arrow-julia/blob/63d2c9d3ca4539a0ea831ae8ecafa71b051d475d/src/ArrowTypes/src/ArrowTypes.jl#L337-L339 ```julia (@v1.9) pkg> activate --temp

[GitHub] [arrow] westonpace closed issue #34405: [C++] Make it possible to specify custom field names in QueryOptions

2023-03-07 Thread via GitHub
westonpace closed issue #34405: [C++] Make it possible to specify custom field names in QueryOptions URL: https://github.com/apache/arrow/issues/34405 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] westonpace closed issue #33434: [C++] Add Acero test to ensure names in the root relation of Substrain plans are retained

2023-03-07 Thread via GitHub
westonpace closed issue #33434: [C++] Add Acero test to ensure names in the root relation of Substrain plans are retained URL: https://github.com/apache/arrow/issues/33434 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

[GitHub] [arrow] wjones127 closed issue #34322: [C++][Parquet] Add benchmarks for micro benchmark for DELTA_LENGTH_BYTE_ARRAY

2023-03-07 Thread via GitHub
wjones127 closed issue #34322: [C++][Parquet] Add benchmarks for micro benchmark for DELTA_LENGTH_BYTE_ARRAY URL: https://github.com/apache/arrow/issues/34322 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] westonpace closed issue #33341: [C++] arrow::UInt64Builder::Reset() doesn't affect the builder's length()

2023-03-07 Thread via GitHub
westonpace closed issue #33341: [C++] arrow::UInt64Builder::Reset() doesn't affect the builder's length() URL: https://github.com/apache/arrow/issues/33341 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-adbc] david-waterworth opened a new issue, #496: Unsupported data types for python

2023-03-07 Thread via GitHub
david-waterworth opened a new issue, #496: URL: https://github.com/apache/arrow-adbc/issues/496 Is there a list of what postgres datatypes are/are not supported by the python api? ``` python from adbc_driver_postgresql import dbapi with dbapi.connect(DSN) as conn: wit

[GitHub] [arrow] YoungRX opened a new issue, #34494: [C++] How to handle the limit clause when scanning Parquet files using Scanner?

2023-03-07 Thread via GitHub
YoungRX opened a new issue, #34494: URL: https://github.com/apache/arrow/issues/34494 ### Describe the usage question you have. Please include as many useful details as possible. I use `AsyncScanner::ToRecordBatchReader()` and `ScannerRecordBatchReader::ReadNext(std::shared_ptr*

[GitHub] [arrow] Fokko opened a new issue, #34495: [Python] Make `Array` iterable

2023-03-08 Thread via GitHub
Fokko opened a new issue, #34495: URL: https://github.com/apache/arrow/issues/34495 ### Describe the enhancement requested This would allow for more Pythonic patterns in downstream libraries. ``` >>> next(pa.chunked_array([[1,2,3]])) Traceback (most recent call last):

[GitHub] [arrow] mapleFU opened a new issue, #34496: [C++][Parquet] Parquet Unittest `RecordReaderStressTest` may segment fault

2023-03-08 Thread via GitHub
mapleFU opened a new issue, #34496: URL: https://github.com/apache/arrow/issues/34496 ### Describe the bug, including details regarding any error messages, version, and platform. In `RecordReaderStressTest.StressTest`. ```c++ std::default_random_engine gen(seed1); //

[GitHub] [arrow] thisisnic closed issue #34467: [R] Disable DuckDB tests on R versions < 4.0.0

2023-03-08 Thread via GitHub
thisisnic closed issue #34467: [R] Disable DuckDB tests on R versions < 4.0.0 URL: https://github.com/apache/arrow/issues/34467 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] thisisnic opened a new issue, #34501: [C++] Substrait plan with multiple aggregate fields returns incorrect results

2023-03-08 Thread via GitHub
thisisnic opened a new issue, #34501: URL: https://github.com/apache/arrow/issues/34501 ### Describe the bug, including details regarding any error messages, version, and platform. When I specify a plan containing more than 1 aggregate, the results I get back are incorrect - the valu

[GitHub] [arrow] thisisnic closed issue #34211: [R] Segmentation fault when using write_parquet()

2023-03-08 Thread via GitHub
thisisnic closed issue #34211: [R] Segmentation fault when using write_parquet() URL: https://github.com/apache/arrow/issues/34211 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [arrow] zeroshade closed issue #34492: PlainBooleanEncoder occasionally emits incorrect data over page boundaries

2023-03-08 Thread via GitHub
zeroshade closed issue #34492: PlainBooleanEncoder occasionally emits incorrect data over page boundaries URL: https://github.com/apache/arrow/issues/34492 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow-adbc] lidavidm closed issue #448: [CI] Nightly verification job should `git tag -f`

2023-03-08 Thread via GitHub
lidavidm closed issue #448: [CI] Nightly verification job should `git tag -f` URL: https://github.com/apache/arrow-adbc/issues/448 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [arrow] westonpace closed issue #34257: [Docs][Dev] Update git links/branches from master to main for external projects

2023-03-08 Thread via GitHub
westonpace closed issue #34257: [Docs][Dev] Update git links/branches from master to main for external projects URL: https://github.com/apache/arrow/issues/34257 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

<    10   11   12   13   14   15   16   17   18   19   >