Re: [PR] [parquet] Add row group index virtual column [arrow-rs]

2026-02-09 Thread via GitHub
vustef commented on PR #9117: URL: https://github.com/apache/arrow-rs/pull/9117#issuecomment-3871071014 > I think the extension type marker was pioneered by @jkylling and @vustef > > I will also try and review this sooner rather than later Sorry for not taking a look at this, I

Re: [PR] GH-48962: [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
guillaume-rochette-oxb commented on PR #48963: URL: https://github.com/apache/arrow/pull/48963#issuecomment-3871096056 Hi @raulcd, No, none of this was "AI" generated, I have put the effort and typed it all by myself. However, I understand that it does not follow the structure,

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3871251482 @github-actions crossbow submit -g verify-rc-source -g packaging -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

Re: [PR] GH-49159: [C++][Gandiva] Detect overflow in repeat() [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #49160: URL: https://github.com/apache/arrow/pull/49160#issuecomment-3871248902 @pitrou what do you think about this? I plan to merge this and include it as part of 23.0.1 if there's no issues with it. This is the only thing missing so far from me creating RC0.

Re: [PR] [C++][Parquet] GH-47628: Implement basic parquet file rewriter [arrow]

2026-02-09 Thread via GitHub
HuaHuaY commented on code in PR #47775: URL: https://github.com/apache/arrow/pull/47775#discussion_r2782236557 ## cpp/src/parquet/file_rewriter.cc: ## @@ -0,0 +1,455 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agreements. Se

Re: [I] [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
guillaume-rochette-oxb commented on issue #48962: URL: https://github.com/apache/arrow/issues/48962#issuecomment-3871294989 Hey @rok, Yes, at runtime, we are reading them with `pyarrow.dataset.dataset().to_batches()`, however, we do not have the possibility to control the `row_group_size

[PR] feat: support RunEndEncoded arrays in arrow-json reader and writer [arrow-rs]

2026-02-09 Thread via GitHub
Abhisheklearn12 opened a new pull request, #9379: URL: https://github.com/apache/arrow-rs/pull/9379 # Which issue does this PR close? - Closes #9359. # Rationale for this change The `arrow-json` crate does not support `RunEndEncoded` arrays. This adds read and write supp

Re: [PR] [C++][Parquet] GH-47628: Implement basic parquet file rewriter [arrow]

2026-02-09 Thread via GitHub
HuaHuaY commented on code in PR #47775: URL: https://github.com/apache/arrow/pull/47775#discussion_r2782244662 ## cpp/src/parquet/file_rewriter.cc: ## @@ -0,0 +1,455 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agreements. Se

Re: [PR] GH-48962: [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
guillaume-rochette-oxb commented on PR #48963: URL: https://github.com/apache/arrow/pull/48963#issuecomment-3871317315 Thanks, I'll relocate the code in `table.pxi`, do the same for the tests, and format the docstrings accordingly. In your C++ snippet, are the `batches` referring to a

Re: [PR] GH-1014: [Docs] Fix broken and obsolete links in the README.md [arrow-java]

2026-02-09 Thread via GitHub
axreldable commented on PR #1015: URL: https://github.com/apache/arrow-java/pull/1015#issuecomment-3871360534 Please add label: `documentation` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] GH-1014: [Docs] Fix broken and obsolete links in the README.md [arrow-java]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #1015: URL: https://github.com/apache/arrow-java/pull/1015#issuecomment-3871358838 Thank you for opening a pull request! Please label the PR with one or more of: - bug-fix - chore - dependencies - documentation - enhancemen

Re: [I] [Doc] Make main column width larger [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29314: URL: https://github.com/apache/arrow/issues/29314#issuecomment-3871167122 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [C++] Improve performance of unpack64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29313: URL: https://github.com/apache/arrow/issues/29313#issuecomment-3871167026 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [Python] Use IPC writing code for pickling RecordBatches [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29325: URL: https://github.com/apache/arrow/issues/29325#issuecomment-3871167243 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [C++] Add an arrow::Table::GetFieldByName method [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29279: URL: https://github.com/apache/arrow/issues/29279#issuecomment-3871166624 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [C++][Compute] Add Find method to Grouper [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29341: URL: https://github.com/apache/arrow/issues/29341#issuecomment-3871167375 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [C++][Compute] Provide a default implementation of ExecNode::Pause/Resume [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29296: URL: https://github.com/apache/arrow/issues/29296#issuecomment-3871166881 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [C++][Dataset] Remove UnionDataset in favor of UnionExecNode [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on issue #29291: URL: https://github.com/apache/arrow/issues/29291#issuecomment-3871166755 This issue has been marked as stale because it has had no activity in the past 365 days. Please remove the stale label or comment below, or this issue will be closed in 1

Re: [I] [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
rok commented on issue #48962: URL: https://github.com/apache/arrow/issues/48962#issuecomment-3871222436 Hey @guillaume-rochette-oxb, sorry I'm late to the party. > I would like to add a functionality enabling to dynamically restack/resize a stream of pa.RecordBatch w.r.t. to minimums

Re: [PR] GH-48962: [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #48963: URL: https://github.com/apache/arrow/pull/48963#issuecomment-3871235959 Thanks for your comment and thanks for sharing the details about AI usage. The functionality you are proposing can potentially make sense, even though it could require non-zero copy

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3871268112 Revision: 985621dbfcf3fd2061889e43c50b59825df84f3f Submitted crossbow builds: [ursacomputing/crossbow @ actions-60c4e00895](https://github.com/ursacomputing/crossbow/bra

[PR] GH-1014: [Docs] Fix broken and obsolete links in the README.md [arrow-java]

2026-02-09 Thread via GitHub
axreldable opened a new pull request, #1015: URL: https://github.com/apache/arrow-java/pull/1015 ## What's Changed Fix broken links in `README.md` file. Remove the unused reference [2]: https://github.com/apache/arrow/blob/main/cpp/README.md. Inline footnote links. Close

Re: [I] [Doc] Cannot switch to doc for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
shashbha14 commented on issue #49187: URL: https://github.com/apache/arrow/issues/49187#issuecomment-3871395111 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsu

Re: [PR] GH-48962: [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
rok commented on PR #48963: URL: https://github.com/apache/arrow/pull/48963#issuecomment-3871509729 Here's a quick LLM-made, throwaway sketch: https://github.com/rok/arrow/pull/48 It's meant to show where you could place logic and test it. Given your usecase I'm not sure a C++ implemen

Re: [PR] [Variant] Support `['fieldName']` in VariantPath parser [arrow-rs]

2026-02-09 Thread via GitHub
klion26 commented on code in PR #9276: URL: https://github.com/apache/arrow-rs/pull/9276#discussion_r2782241113 ## parquet-variant/src/path.rs: ## @@ -103,6 +113,12 @@ impl<'a> VariantPath<'a> { pub fn is_empty(&self) -> bool { self.0.is_empty() } + +/// P

Re: [PR] [C++][Parquet] GH-47628: Implement basic parquet file rewriter [arrow]

2026-02-09 Thread via GitHub
Copilot commented on code in PR #47775: URL: https://github.com/apache/arrow/pull/47775#discussion_r2782481214 ## cpp/src/parquet/bloom_filter_writer.h: ## @@ -92,6 +92,18 @@ class PARQUET_EXPORT BloomFilterBuilder { /// - `WriteTo()` has been called virtual BloomFilter*

[PR] Fix docs version switcher URL for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
shashbha14 opened a new pull request, #49188: URL: https://github.com/apache/arrow/pull/49188 When selecting "1.0" in the docs version dropdown, it redirected back to the dev docs. The entry for 1.0 in `docs/source/_static/versions.json` had `url: https://arrow.apache.org/

Re: [PR] Fix docs version switcher URL for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49188: URL: https://github.com/apache/arrow/pull/49188#issuecomment-3871586893 Thanks for opening a pull request! If this is not a [minor PR](https://github.com/apache/arrow/blob/main/CONTRIBUTING.md#Minor-Fixes). Could you open an issue f

Re: [PR] feat: support RunEndEncoded arrays in arrow-json reader and writer [arrow-rs]

2026-02-09 Thread via GitHub
Abhisheklearn12 commented on PR #9379: URL: https://github.com/apache/arrow-rs/pull/9379#issuecomment-3871627338 Hi @Jefffrey, I’d love to get your feedback whenever you have time. Appreciate it! -- This is an automated message from the Apache Git Service. To respond to the message, pleas

Re: [PR] Fix docs version switcher URL for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
rok commented on PR #49188: URL: https://github.com/apache/arrow/pull/49188#issuecomment-3871636346 This PR appears to be introducing unrelated C++ changes. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [I] [C++] CRAN build fail on missing `std::floating_point` concept [arrow]

2026-02-09 Thread via GitHub
jonkeane commented on issue #49176: URL: https://github.com/apache/arrow/issues/49176#issuecomment-3871728137 > Or is it possible to change the deployment target to something newer on the CRAN build? As far as I'm aware, no. For example, https://cran.r-project.org/doc/manuals/r-relea

Re: [PR] MINOR: [R][C++] use macos14 to see if that replicate issues on CRAN [arrow]

2026-02-09 Thread via GitHub
jonkeane commented on PR #49178: URL: https://github.com/apache/arrow/pull/49178#issuecomment-3871742382 There's more discussion on #49176, but this hasn't worked and likely will not: macos 13 runners aren't readily available anymore, and using older clang on ubuntu does not replicate the i

Re: [PR] MINOR: [R][C++] use macos14 to see if that replicate issues on CRAN [arrow]

2026-02-09 Thread via GitHub
jonkeane closed pull request #49178: MINOR: [R][C++] use macos14 to see if that replicate issues on CRAN URL: https://github.com/apache/arrow/pull/49178 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870165678 @github-actions crossbow submit test-debian-12-docs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870174265 ``` Unable to match any tasks for `test-debian-12-docs` The Archery job run can be found at: https://github.com/apache/arrow/actions/runs/21818077593 ``` -- This is

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870221818 ``` Only contributors can submit requests to this bot. Please ask someone from the community for help with getting the first commit in. The Archery job run can be found a

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
MugundanMCW commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870218970 @github-actions crossbow submit wheel-windows-*arm64 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870222771 ``` Only contributors can submit requests to this bot. Please ask someone from the community for help with getting the first commit in. The Archery job run can be found a

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-387097 ``` Only contributors can submit requests to this bot. Please ask someone from the community for help with getting the first commit in. The Archery job run can be found a

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
MugundanMCW commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870237802 Hi @raulcd I have added support for Windows ARM64 thread builds in the CI, could you trigger the PyArrow builds? -- This is an automated message from the Apache Git Service. To

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
kou commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870244666 @github-actions crossbow submit wheel-windows-cp314-* -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870248944 @github-actions crossbow submit test-debian-13-docs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870261188 Revision: 1bea06ad4e14d75dd97a78a0148cd9cf6f4df0bc Submitted crossbow builds: [ursacomputing/crossbow @ actions-8c965f9fff](https://github.com/ursacomputing/crossbow/bra

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870258233 Revision: 41418e0df78d9449ee3e9354f7ce3148df02419c Submitted crossbow builds: [ursacomputing/crossbow @ actions-cc9438ddaa](https://github.com/ursacomputing/crossbow/bra

Re: [PR] [C++] Add optional native _Float16 fast-path for Float16 conversions. [arrow]

2026-02-09 Thread via GitHub
Arbaaz123676 closed pull request #49180: [C++] Add optional native _Float16 fast-path for Float16 conversions. URL: https://github.com/apache/arrow/pull/49180 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [I] [Python] Remove --disable-warnings with newer version of pytest-cython [arrow]

2026-02-09 Thread via GitHub
AlenkaF commented on issue #33715: URL: https://github.com/apache/arrow/issues/33715#issuecomment-3870617462 Upstream issue has been resolved and a [new release](https://github.com/pytest-cython/pytest-cython/releases/tag/v0.4.0rc0) is out so I will open a PR that removes `--disable-warning

Re: [I] [Doc] tzdata error due to a lack of a discoverable system timezone database [arrow]

2026-02-09 Thread via GitHub
h-vetinari commented on issue #49172: URL: https://github.com/apache/arrow/issues/49172#issuecomment-3870843921 Well the main thing AFAICT is that #48601 hasn't been merged yet. I backported this not so much for fun, but because there was no realistic way to unbreak CI after a latent build

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870803871 @github-actions crossbow submit test-debian-13-docs -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] WIP: Dummy PR to check maint-23.0.1 status [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49130: URL: https://github.com/apache/arrow/pull/49130#issuecomment-3870813475 Revision: 985621dbfcf3fd2061889e43c50b59825df84f3f Submitted crossbow builds: [ursacomputing/crossbow @ actions-7b7dc30106](https://github.com/ursacomputing/crossbow/bra

Re: [PR] fix: resolution of complex type variants in Avro unions [arrow-rs]

2026-02-09 Thread via GitHub
mzabaluev commented on code in PR #9328: URL: https://github.com/apache/arrow-rs/pull/9328#discussion_r2781821762 ## arrow-avro/src/reader/record.rs: ## @@ -1054,10 +1082,45 @@ impl Decoder { } } +fn decode_with_resolution<'d>( +&'d mut self, +

Re: [I] [Doc] Cannot switch to doc for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
pitrou commented on issue #49187: URL: https://github.com/apache/arrow/issues/49187#issuecomment-3870886760 @AlenkaF -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [I] [Doc] Cannot switch to doc for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
pitrou commented on issue #49187: URL: https://github.com/apache/arrow/issues/49187#issuecomment-3870891568 I'll note that I can still visit them by changing the URL manually: https://arrow.apache.org/docs/1.0/ -- This is an automated message from the Apache Git Service. To respond to the

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
MugundanMCW commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870907905 @kou the failure appears to be caused by a missing License file in the generated wheel package. Is that a known issue? https://github.com/user-attachments/assets/52369f10-ba26-42c2-

Re: [I] [C++] C++20 modernization [arrow]

2026-02-09 Thread via GitHub
HuaHuaY commented on issue #48587: URL: https://github.com/apache/arrow/issues/48587#issuecomment-3870744730 It seems that some ci environments not support `#include ` or `std::views::join`. For examples: https://github.com/apache/arrow/actions/runs/21817350381/job/62941834858 https

Re: [I] [Doc] tzdata error due to a lack of a discoverable system timezone database [arrow]

2026-02-09 Thread via GitHub
shr3yas-k commented on issue #49172: URL: https://github.com/apache/arrow/issues/49172#issuecomment-3870759419 I used Miniforge3 to setup conda-forge and I cloned the apache arrow repo locally and everything after that was according to the standard python-development workflow for pyarrow.

Re: [I] [Doc] tzdata error due to a lack of a discoverable system timezone database [arrow]

2026-02-09 Thread via GitHub
rok commented on issue #49172: URL: https://github.com/apache/arrow/issues/49172#issuecomment-3870784876 Do you currently still see this issue? @h-vetinari has recently backported a fix for windows on conda-forge: https://github.com/apache/arrow/pull/48601#issuecomment-3794269486 and can p

Re: [I] [CI] Update default platform versions in `.env` [arrow]

2026-02-09 Thread via GitHub
raulcd commented on issue #49024: URL: https://github.com/apache/arrow/issues/49024#issuecomment-3870796298 cherry-picking this commit hasn't solved the problem and it still fails on the maintenance branch. It seems the required issue is the following one: - https://github.com/apache/arro

Re: [I] [R][C++] Bump C++20 in R build infrastructure [arrow]

2026-02-09 Thread via GitHub
raulcd commented on issue #48817: URL: https://github.com/apache/arrow/issues/48817#issuecomment-3870796837 This seems to be required for 23.0.1 in order to build the docs. Otherwise this fails: ``` *** Trying Arrow C++ found by pkg-config: /tmp/local C++ library version 23.0.

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870977218 > the failure appears to be caused by a missing License file in the generated wheel package. Is that a known issue? No, this is working on main. You should use: ``` %PYTHON_CM

Re: [I] [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
guillaume-rochette-oxb commented on issue #48962: URL: https://github.com/apache/arrow/issues/48962#issuecomment-3870980307 Hi @AlenkaF, @raulcd, @rok, Could you please spare a moment to provide an answer about this potential enhancement request, and its associated [PR](https://github.co

Re: [PR] fix: resolution of complex type variants in Avro unions [arrow-rs]

2026-02-09 Thread via GitHub
mzabaluev-flarion commented on code in PR #9328: URL: https://github.com/apache/arrow-rs/pull/9328#discussion_r2781933353 ## arrow-avro/src/codec.rs: ## @@ -1533,62 +1550,35 @@ impl<'a> Maker<'a> { nullable_union_variants(reader_variants) {

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
MugundanMCW commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3870999068 @raulcd Thanks, I have made the corresponding changes. Could you trigger the Wheel builder again? -- This is an automated message from the Apache Git Service. To respond to the mess

Re: [I] [Python] Add `restack_batches()` [arrow]

2026-02-09 Thread via GitHub
raulcd commented on issue #48962: URL: https://github.com/apache/arrow/issues/48962#issuecomment-3871032377 I've answered on the PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
raulcd commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3871039830 @github-actions crossbow submit wheel-windows-cp314-* -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [I] [Doc] tzdata error due to a lack of a discoverable system timezone database [arrow]

2026-02-09 Thread via GitHub
rok commented on issue #49172: URL: https://github.com/apache/arrow/issues/49172#issuecomment-3871044267 @shr3yas-k given the current state it then makes sense to document this. Do please check where we currently document [tz data related things](https://arrow.apache.org/docs/search.html?q=

Re: [I] [CI][Python][C++] Support on Power Architecture [arrow]

2026-02-09 Thread via GitHub
sandeepgupta12 commented on issue #43817: URL: https://github.com/apache/arrow/issues/43817#issuecomment-3871041973 Hi @assignUser, I wanted to share an update from our side. Based on community input, we’ve improved how authentication tokens are generated for Organizations and have

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3871051791 Revision: 4b4855ff9dfa8c8c66f048bb0c3ae922c0dbbb68 Submitted crossbow builds: [ursacomputing/crossbow @ actions-bb088c59f9](https://github.com/ursacomputing/crossbow/bra

Re: [I] [C++] Enable hardware support for arrow::util::Float16 on GCC and Clang [arrow]

2026-02-09 Thread via GitHub
andishgar commented on issue #49179: URL: https://github.com/apache/arrow/issues/49179#issuecomment-3870118610 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

Re: [I] [C++][Azure] Add ApplicationId to AzureFileSystem [arrow]

2026-02-09 Thread via GitHub
raulcd commented on issue #49169: URL: https://github.com/apache/arrow/issues/49169#issuecomment-3870105079 Is `azpartner-` a recommendation from the Azure team for third party User Agent identifier? -- This is an automated message from the Apache Git Service. To respond to the message,

Re: [I] [C++] Enable hardware support for arrow::util::Float16 on GCC and Clang [arrow]

2026-02-09 Thread via GitHub
andishgar commented on issue #49179: URL: https://github.com/apache/arrow/issues/49179#issuecomment-3870138773 @kou, this PR (#49180) is not related to my work. If possible, I would appreciate it if you could close it. -- This is an automated message from the Apache Git Service. To respon

Re: [PR] fix: fix [[NULL]] array doesn't roundtrip in arrow-row bug [arrow-rs]

2026-02-09 Thread via GitHub
lichuang commented on PR #9275: URL: https://github.com/apache/arrow-rs/pull/9275#issuecomment-3872016362 @alamb -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [PR] [Parquet] Add SIMD-accelerated byte-stream-split decoding [arrow-go]

2026-02-09 Thread via GitHub
daniel-adam-tfs commented on PR #654: URL: https://github.com/apache/arrow-go/pull/654#issuecomment-3872060268 > @daniel-adam-tfs any further work needed here to make it ready for review? Yeah, let me add the equivalent version for float64 and check how complex would it be do add SIMD

[I] Parallel Parquet Reading [arrow-rs]

2026-02-09 Thread via GitHub
pmarks opened a new issue, #9381: URL: https://github.com/apache/arrow-rs/issues/9381 **Is your feature request related to a problem or challenge? Please describe what you are trying to do.** I want to make many parallel data fetch requests to the underlying object store when fetching da

Re: [PR] GH-49159: [C++][Gandiva] Detect overflow in repeat() [arrow]

2026-02-09 Thread via GitHub
pitrou commented on code in PR #49160: URL: https://github.com/apache/arrow/pull/49160#discussion_r2783019670 ## cpp/src/gandiva/precompiled/string_ops.cc: ## @@ -841,7 +841,12 @@ const char* repeat_utf8_int32(gdv_int64 context, const char* in, gdv_int32 in_le *out_len = 0

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783049368 ## cpp/src/arrow/util/bpacking_dispatch_internal.h: ## @@ -188,297 +202,431 @@ void unpack_width(const uint8_t* in, UnpackedUInt* out, int batch_size, int bit_

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783043956 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783058104 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] Azure ADLS list_with_offset support [arrow-rs-object-store]

2026-02-09 Thread via GitHub
crepererum commented on PR #623: URL: https://github.com/apache/arrow-rs-object-store/pull/623#issuecomment-3872200141 The Azure emulator test is failing in CI. Rest looks good though. Big thanks for adding another generic test and adding it to all other implementations :+1: -- This is

Re: [I] [Doc] Cannot switch to doc for Arrow 1.0 [arrow]

2026-02-09 Thread via GitHub
AlenkaF commented on issue #49187: URL: https://github.com/apache/arrow/issues/49187#issuecomment-3872203767 This is strange, the link is wrong here: https://github.com/apache/arrow-site/blob/834cb4c24abfc70b069aa0d4d34b354e27c6ceaa/docs/_static/versions.json#L131 so this is the only

[I] Arrow-csv does not update null flag in `Field` on inference [arrow-rs]

2026-02-09 Thread via GitHub
realonbebeto opened a new issue, #9380: URL: https://github.com/apache/arrow-rs/issues/9380 **Describe the bug** **To Reproduce** **Expected behaviour** **Additional context** -- This is an automated message from the Apache Git Service. To respo

Re: [PR] Add CRC64NVME checksum support [arrow-rs-object-store]

2026-02-09 Thread via GitHub
crepererum commented on PR #633: URL: https://github.com/apache/arrow-rs-object-store/pull/633#issuecomment-3871859606 Well, it was a suggestion, but it's not a hill I'm gonna die on though :wink: -- This is an automated message from the Apache Git Service. To respond to the message, pl

Re: [I] [C++] CRAN build fail on missing `std::floating_point` concept [arrow]

2026-02-09 Thread via GitHub
pitrou commented on issue #49176: URL: https://github.com/apache/arrow/issues/49176#issuecomment-3872123115 Perhaps you can enable https://cmake.org/cmake/help/latest/variable/CMAKE_VERBOSE_MAKEFILE.html on the CRAN build to see which command-line flags precisely are passed to the compiler?

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783116838 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783129267 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783144434 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783168609 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

[PR] Update sysinfo requirement from 0.37.1 to 0.38.1 [arrow-rs]

2026-02-09 Thread via GitHub
dependabot[bot] opened a new pull request, #9383: URL: https://github.com/apache/arrow-rs/pull/9383 Updates the requirements on [sysinfo](https://github.com/GuillaumeGomez/sysinfo) to permit the latest version. Changelog Sourced from https://github.com/GuillaumeGomez/sysinfo/blob/

Re: [PR] chore(deps): update sysinfo requirement from 0.37.1 to 0.38.0 [arrow-rs]

2026-02-09 Thread via GitHub
dependabot[bot] commented on PR #9265: URL: https://github.com/apache/arrow-rs/pull/9265#issuecomment-3872419734 Superseded by #9383. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] chore(deps): update sysinfo requirement from 0.37.1 to 0.38.0 [arrow-rs]

2026-02-09 Thread via GitHub
dependabot[bot] closed pull request #9265: chore(deps): update sysinfo requirement from 0.37.1 to 0.38.0 URL: https://github.com/apache/arrow-rs/pull/9265 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

[PR] GH-49184: [CI] AMD64 macOS 15-intel Python 3 consistently times out [arrow]

2026-02-09 Thread via GitHub
tadeja opened a new pull request, #49189: URL: https://github.com/apache/arrow/pull/49189 ### Rationale for this change Recent CI checks failing with the job `AMD64 macOS 15-intel Python 3` being cancelled at 60 minutes. ```The job has exceeded the maximum execution time of 1h0m0s```

Re: [PR] GH-49184: [CI] AMD64 macOS 15-intel Python 3 consistently times out [arrow]

2026-02-09 Thread via GitHub
github-actions[bot] commented on PR #49189: URL: https://github.com/apache/arrow/pull/49189#issuecomment-3872429839 :warning: GitHub issue #49184 **has been automatically assigned in GitHub** to PR creator. -- This is an automated message from the Apache Git Service. To respond to the mes

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783173930 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783188395 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783197094 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

Re: [PR] GH-48277: [C++][Parquet] unpack with shuffle algorithm [arrow]

2026-02-09 Thread via GitHub
AntoinePrv commented on code in PR #47994: URL: https://github.com/apache/arrow/pull/47994#discussion_r2783198073 ## cpp/src/arrow/util/bpacking_simd_kernel_internal.h: ## @@ -0,0 +1,790 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor li

[PR] Update rand requirement from 0.9 to 0.10 [arrow-rs]

2026-02-09 Thread via GitHub
dependabot[bot] opened a new pull request, #9382: URL: https://github.com/apache/arrow-rs/pull/9382 Updates the requirements on [rand](https://github.com/rust-random/rand) to permit the latest version. Changelog Sourced from https://github.com/rust-random/rand/blob/master/CHANGELOG

Re: [PR] Orasort based sort kernels [arrow-rs]

2026-02-09 Thread via GitHub
alamb commented on PR #9300: URL: https://github.com/apache/arrow-rs/pull/9300#issuecomment-3872460083 > I don't understand the benchmarks, can someone explain it to me? I see 1.23 on main and 1 on this branch. In both runs main looks like having more than 1. What am I missing? It se

Re: [PR] GH-49114: [C++][Parquet] Fix converting schema failure with deep nested two-level encoding list structure [arrow]

2026-02-09 Thread via GitHub
wgtmac commented on code in PR #49125: URL: https://github.com/apache/arrow/pull/49125#discussion_r2783197621 ## cpp/src/parquet/arrow/schema.cc: ## @@ -620,7 +621,8 @@ Status MapToSchemaField(const GroupNode& group, LevelInfo current_levels, if (group.field_count() != 1) {

Re: [PR] GH-47195: [Python][CI] Add support for building PyArrow library on Windows ARM64 [arrow]

2026-02-09 Thread via GitHub
MugundanMCW commented on PR #48539: URL: https://github.com/apache/arrow/pull/48539#issuecomment-3872522265 Hi @raulcd Pushed few more changes to fix the failing Windows ARM64 jobs and got a successful run on my fork. Could you trigger the CI again. -- This is an automated message fro

[PR] Gpatterson/issue 15 interval mdn tests [arrow-js]

2026-02-09 Thread via GitHub
GeorgeLeePatterson opened a new pull request, #379: URL: https://github.com/apache/arrow-js/pull/379 ## Summary - fix "'`IntervalMonthDayNano` conversion to avoid `BigInt(number)` precision traps for unsafe integers - add regression coverage in `test/unit/vector/interval-month-day

  1   2   3   >