[GitHub] [arrow] felipecrv opened a new issue, #34176: Finish basic Run-End Encoded arrays support in C++

2023-02-13 Thread via GitHub
felipecrv opened a new issue, #34176: URL: https://github.com/apache/arrow/issues/34176 ### Describe the enhancement requested C++ related issues that are sub-tasks of #32104 that haven't been fixed by #33641. - [ ] #32105 - [ ] #32107 - [ ] #20351 - [ ] #32773

[GitHub] [arrow] ianmcook closed issue #34166: [R] int64 not preserved when calling dplyr::collect

2023-02-13 Thread via GitHub
ianmcook closed issue #34166: [R] int64 not preserved when calling dplyr::collect URL: https://github.com/apache/arrow/issues/34166 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

[GitHub] [arrow] wjones127 closed issue #15231: [Benchmarking][C++] Track memory usage in C++ microbenchmarks

2023-02-13 Thread via GitHub
wjones127 closed issue #15231: [Benchmarking][C++] Track memory usage in C++ microbenchmarks URL: https://github.com/apache/arrow/issues/15231 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

[GitHub] [arrow] lidavidm opened a new issue, #34175: [Docs] .github/CONTRIBUTING.md still references Jira

2023-02-13 Thread via GitHub
lidavidm opened a new issue, #34175: URL: https://github.com/apache/arrow/issues/34175 ### Describe the bug, including details regarding any error messages, version, and platform. This should be updated to reflect that we now use GitHub Issues. This one is important since it ap

[GitHub] [arrow] james-camacho-ab closed issue #12892: [R] Arrow install on Databricks cluster takes 10+ minutes

2023-02-13 Thread via GitHub
james-camacho-ab closed issue #12892: [R] Arrow install on Databricks cluster takes 10+ minutes URL: https://github.com/apache/arrow/issues/12892 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] lidavidm opened a new issue, #34174: [Docs][Release] Add 'tweet out the blog post' as a post-release task

2023-02-13 Thread via GitHub
lidavidm opened a new issue, #34174: URL: https://github.com/apache/arrow/issues/34174 ### Describe the enhancement requested We tend to forget to do this. While Twitter has been shaky recently, it's still used quite a bit, and it would be good to promote new releases. What do people

[GitHub] [arrow-adbc] lidavidm opened a new issue, #454: [Python] Add __del__ to DBAPI objects

2023-02-13 Thread via GitHub
lidavidm opened a new issue, #454: URL: https://github.com/apache/arrow-adbc/issues/454 Just for convenience/ease of use. Optionally we can have this emit a warning to aid debuggability? -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [arrow] mroeschke opened a new issue, #34173: [Python]. Allow pyarrow.compute.mode to include null count

2023-02-13 Thread via GitHub
mroeschke opened a new issue, #34173: URL: https://github.com/apache/arrow/issues/34173 ### Describe the enhancement requested There is a `skip_nulls` argument to dictate whether nulls should make the result null or be skipped, but it would be potentially be useful for `mode` to retu

[GitHub] [arrow] zeroshade opened a new issue, #34171: [Go][Compute] Add kernel for "unique" function

2023-02-13 Thread via GitHub
zeroshade opened a new issue, #34171: URL: https://github.com/apache/arrow/issues/34171 ### Describe the enhancement requested Following up on #33466, in order to implement direct and efficient handling of dictionary arrays to/from parquet without having to expand them out, we first

[GitHub] [arrow] egillax opened a new issue, #34166: [R] int64 not preserved when calling dplyr::collect

2023-02-13 Thread via GitHub
egillax opened a new issue, #34166: URL: https://github.com/apache/arrow/issues/34166 ### Describe the bug, including details regarding any error messages, version, and platform. When collecting arrow tables with 64 bit integer columns the column is converted to 32 bit integer. In th

[GitHub] [arrow] AlenkaF opened a new issue, #34165: [Python] Extension array data type should default to the storage type if to_pandas_dtype is not implemented

2023-02-13 Thread via GitHub
AlenkaF opened a new issue, #34165: URL: https://github.com/apache/arrow/issues/34165 ### Describe the bug, including details regarding any error messages, version, and platform. When working on the extension type for tensors in PyArrow I came across a behaviour of the conversion to

[GitHub] [arrow] zeroshade closed issue #34077: [Go] Implement RunEndEncoded scalar

2023-02-13 Thread via GitHub
zeroshade closed issue #34077: [Go] Implement RunEndEncoded scalar URL: https://github.com/apache/arrow/issues/34077 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[GitHub] [arrow] NoahFournier opened a new issue, #34163: [C++][CI] Typo in build_orc CMake macro

2023-02-13 Thread via GitHub
NoahFournier opened a new issue, #34163: URL: https://github.com/apache/arrow/issues/34163 ### Describe the bug, including details regarding any error messages, version, and platform. I've found a typo in the build_orc macro in the ThirdPartyToolchain, which means that the orc build

[GitHub] [arrow] zeroshade closed issue #34101: [Go] pqarrow.NewSchemaManifest creates wrong schema field for array object fields

2023-02-13 Thread via GitHub
zeroshade closed issue #34101: [Go] pqarrow.NewSchemaManifest creates wrong schema field for array object fields URL: https://github.com/apache/arrow/issues/34101 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

[GitHub] [arrow] Fokko opened a new issue, #34162: [Python] `is_null(nan_is_null=True)` does not work with only NaN's

2023-02-13 Thread via GitHub
Fokko opened a new issue, #34162: URL: https://github.com/apache/arrow/issues/34162 ### Describe the bug, including details regarding any error messages, version, and platform. I was working on some test-cases for the PyIceberg integration, and hit this edge case. When you have a fil

[GitHub] [arrow] nealrichardson closed issue #33892: [R] Map `dplyr::n()` to `count_all` kernel

2023-02-13 Thread via GitHub
nealrichardson closed issue #33892: [R] Map `dplyr::n()` to `count_all` kernel URL: https://github.com/apache/arrow/issues/33892 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] nealrichardson closed issue #33960: [R] Output schema for aggregation is sometimes innacurate

2023-02-13 Thread via GitHub
nealrichardson closed issue #33960: [R] Output schema for aggregation is sometimes innacurate URL: https://github.com/apache/arrow/issues/33960 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [arrow] AlenkaF opened a new issue, #34160: [Docs][Release] Multiple copies/versions of versionwarning.js

2023-02-13 Thread via GitHub
AlenkaF opened a new issue, #34160: URL: https://github.com/apache/arrow/issues/34160 ### Describe the bug, including details regarding any error messages, version, and platform. There are currently three versions of the `versionwarning.js` file in apache/arrow-site: - docs/_s

[GitHub] [arrow] js8544 opened a new issue, #34157: [C++] Configure bundled AWS SDK to use aws-lc instead of OpenSSL

2023-02-13 Thread via GitHub
js8544 opened a new issue, #34157: URL: https://github.com/apache/arrow/issues/34157 ### Describe the enhancement requested Using OpenSSL causes various issues like https://github.com/apache/arrow/pull/33808#issuecomment-1408247269 and https://github.com/apache/arrow/issues/34111. We