[GitHub] [arrow] westonpace opened a new issue, #34347: [C++] Add an end-to-end fuzz test for the new scan node

2023-02-24 Thread via GitHub
westonpace opened a new issue, #34347: URL: https://github.com/apache/arrow/issues/34347 ### Describe the enhancement requested The scanner has quite a few unit tests. However, I think it would benefit from some more robust end-to-end testing as well. As further justification,

[GitHub] [arrow] westonpace closed issue #15059: [C++] The new scan node should use values from fragment guarantees instead of loading them from disk

2023-02-24 Thread via GitHub
westonpace closed issue #15059: [C++] The new scan node should use values from fragment guarantees instead of loading them from disk URL: https://github.com/apache/arrow/issues/15059 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [arrow] westonpace opened a new issue, #34346: [C++] Allow the CSV reader to read zero columns

2023-02-24 Thread via GitHub
westonpace opened a new issue, #34346: URL: https://github.com/apache/arrow/issues/34346 ### Describe the enhancement requested There are times we may want to scan a CSV file and just count the lines. One example is when we want to count the # of rows in a dataset. Another ex

[GitHub] [arrow] westonpace opened a new issue, #34344: [C++] Pass function registry to dataset operations

2023-02-24 Thread via GitHub
westonpace opened a new issue, #34344: URL: https://github.com/apache/arrow/issues/34344 ### Describe the enhancement requested The datasets API does various compute operations on expressions. All of these operations today use the default function registry. This could prevent us fr

[GitHub] [arrow-adbc] adamkennedy opened a new issue, #477: [Java] driver/jdbc: No way to provide an external BufferAllocator to JdbcDriver or JdbcDatabase instances

2023-02-24 Thread via GitHub
adamkennedy opened a new issue, #477: URL: https://github.com/apache/arrow-adbc/issues/477 JdbcDriver is an enum singleton that assigns it's own RootAllocator that is never subsequently destroyed. This is a problem both because there is no ability to control or share memory pools bet

[GitHub] [arrow] kou closed issue #15209: [C++][Gandiva] Add abs function

2023-02-24 Thread via GitHub
kou closed issue #15209: [C++][Gandiva] Add abs function URL: https://github.com/apache/arrow/issues/15209 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mai

[GitHub] [arrow] kou closed issue #34329: [C++] Memory access out of bounds in arrow-1.0.0

2023-02-24 Thread via GitHub
kou closed issue #34329: [C++] Memory access out of bounds in arrow-1.0.0 URL: https://github.com/apache/arrow/issues/34329 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [arrow] abcbarryn opened a new issue, #34341: Arrow version 5 or later fails to compile/link with gcc version 7 (or earlier)

2023-02-24 Thread via GitHub
abcbarryn opened a new issue, #34341: URL: https://github.com/apache/arrow/issues/34341 ### Describe the bug, including details regarding any error messages, version, and platform. When linking the shared library with gcc 7 I get several errors like this: ``_ZN5arrow6ResultISt

[GitHub] [arrow] eitsupi opened a new issue, #34339: [R] Add `skip_rows_after_names` option to `read_csv`'s options

2023-02-24 Thread via GitHub
eitsupi opened a new issue, #34339: URL: https://github.com/apache/arrow/issues/34339 ### Describe the bug, including details regarding any error messages, version, and platform. Add an option to skip rows after the column names. This was implemented in C++ and Python (#28410, #102

[GitHub] [arrow] adamkennedy opened a new issue, #34338: [Java] BaseAllocator.DEBUG should be opt-in as HistoricalLog is immensely expensive and breaks profiling

2023-02-24 Thread via GitHub
adamkennedy opened a new issue, #34338: URL: https://github.com/apache/arrow/issues/34338 ### Describe the bug, including details regarding any error messages, version, and platform. BaseAllocator.DEBUG is currently enabled automatically any time assertions are enabled via -ea which

[GitHub] [arrow] mapleFU opened a new issue, #34335: [C++][Parquet] Improve the performance for Decoding DELTA_LENGTH_BYTE_ARRAY

2023-02-24 Thread via GitHub
mapleFU opened a new issue, #34335: URL: https://github.com/apache/arrow/issues/34335 ### Describe the enhancement requested To be honest, the previous logic is too slow. It's 20 time slower than PLAIN for ByteArray. I'll do some optimizations on it. ### Component(s) C++

[GitHub] [arrow] yevgenypats opened a new issue, #34334: [GO] CSV not handling all files

2023-02-24 Thread via GitHub
yevgenypats opened a new issue, #34334: URL: https://github.com/apache/arrow/issues/34334 ### Describe the enhancement requested Seems like CSV missing some types at the moment (including BinaryType and BinaryFixedLengthType) - https://github.com/apache/arrow/blob/main/go/arrow/csv/c

[GitHub] [arrow-adbc] lidavidm opened a new issue, #476: [Java] driver/jdbc: Connection does not close its allocator

2023-02-24 Thread via GitHub
lidavidm opened a new issue, #476: URL: https://github.com/apache/arrow-adbc/issues/476 https://github.com/apache/arrow-adbc/blob/e79edafdb02339664ea735097f8c1edc0ea052de/java/driver/jdbc/src/main/java/org/apache/arrow/adbc/driver/jdbc/JdbcConnection.java#L254-L257 -- This is an automated

[GitHub] [arrow] icexelloss opened a new issue, #34333: [Python] Test run_query with a registered UDF

2023-02-24 Thread via GitHub
icexelloss opened a new issue, #34333: URL: https://github.com/apache/arrow/issues/34333 ### Describe the enhancement requested In https://github.com/apache/arrow/pull/34288 we added capability for substrait consumer to decode a registered UDF by name. Let's test if this integr

[GitHub] [arrow] westonpace closed issue #31910: [C++] Add support for Substrait cast expression

2023-02-24 Thread via GitHub
westonpace closed issue #31910: [C++] Add support for Substrait cast expression URL: https://github.com/apache/arrow/issues/31910 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] srebhan opened a new issue, #34332: Add driver for `database/sql` framework

2023-02-24 Thread via GitHub
srebhan opened a new issue, #34332: URL: https://github.com/apache/arrow/issues/34332 ### Describe the enhancement requested Due to it's wide adoption and ease-of-use, a driver for Golang's `database/sql` framework will help new users and applications using different data-sources.

[GitHub] [arrow] yevgenypats opened a new issue, #34330: [GO] Parquet file handling extensions

2023-02-24 Thread via GitHub
yevgenypats opened a new issue, #34330: URL: https://github.com/apache/arrow/issues/34330 ### Describe the enhancement requested Hi there! I was trying to use the parquet writer and saw it doesn't support extensions. Im curious if this will be supported or I need to implement some ki

[GitHub] [arrow] luffy-zh opened a new issue, #34329: [c++] Memory access out of bounds in arrow-1.0.0

2023-02-24 Thread via GitHub
luffy-zh opened a new issue, #34329: URL: https://github.com/apache/arrow/issues/34329 ### Describe the bug, including details regarding any error messages, version, and platform. Memory access is out of bounds at line array_nested.cc:78. There is no need to clear the bits of the fin

[GitHub] [arrow] wgtmac opened a new issue, #34328: [CI] AMD64 Windows MinGW 64 GLib & Ruby is failing

2023-02-24 Thread via GitHub
wgtmac opened a new issue, #34328: URL: https://github.com/apache/arrow/issues/34328 ### Describe the bug, including details regarding any error messages, version, and platform. https://github.com/apache/arrow/actions/runs/4261050340/jobs/7414917058 ``` Omission: parquet::Ro