issues
Thread
Date
Earlier messages
Messages by Thread
[I] Table.group_by(...).aggregate([("flag", "any")])` returns incorrect `True` on a sliced Boolean array with nulls [arrow]
via GitHub
[I] [Python][CI] test_string_to_tzinfo_pytz_fallback fails on verify-rc-source-windows [arrow]
via GitHub
[I] arrow/scalar: *scalar.Extension does not implement Release()/Retain(), leaking storage through compute.ScalarDatum.Release() [arrow-go]
via GitHub
[I] [Python][CI] test_table_uses_memory_pool flaky on macOS 14 job [arrow]
via GitHub
Re: [I] [Python][Parquet] read_schema drops extension types (UUID returned as fixed_size_binary[16]) [arrow]
via GitHub
Re: [I] [C++] Allow scanner to assert an ordering and/or support implicit ordering [arrow]
via GitHub
Re: [I] [Parquet][Python] API to decrypt parquet file using one DEK and no metadata [arrow]
via GitHub
[I] [Python] Expose Expression.field_refs() to enumerate referenced fields [arrow]
via GitHub
[I] Add `arrow.range` canonical extension type for bounded ranges [arrow]
via GitHub
Re: [I] [C++][Acero] record_batch_reader_source does not support `select * limit 3` [arrow]
via GitHub
Re: [I] r/adbcsnowflake: Snowflake driver logs at info level causing CMD check fail [arrow-adbc]
via GitHub
[I] parquet/file: WriteBatchSpaced panics escape the API and silently discards commit-write errors [arrow-go]
via GitHub
[I] [C++][Parquet] SIMD-accelerate the SBBF probe in BlockSplitBloomFilter::FindHash [arrow]
via GitHub
Re: [I] [Docs][CI] Enable version switcher during local and PR preview builds [arrow]
via GitHub
Re: [I] [C++] Pivot Support in Acero [arrow]
via GitHub
[I] [Dev] Enable auto GitHub Copilot review [arrow]
via GitHub
Re: [I] [Dev] Enable auto GitHub Copilot review [arrow]
via GitHub
Re: [I] GeoArrow and GeoParquet [arrow-go]
via GitHub
Re: [I] go/adbc/driver/snowflake: NUMBER(n, 0) values get truncated [arrow-adbc]
via GitHub
[I] [CI][C++]: Resolve the macos-cran nightly failures [arrow]
via GitHub
[I] The "copylocks" warning is present in the file "numeric_generic.go". [arrow-go]
via GitHub
Re: [I] The "copylocks" warning is present in the file "numeric_generic.go". [arrow-go]
via GitHub
[I] [CI][Release] Windows verification jobs to set conda environment [arrow]
via GitHub
Re: [I] [CI][Release] Windows verification jobs to set conda environment [arrow]
via GitHub
Re: [I] go/adbc/driver/snowflake: missing xdbc_column_size for binary columns in GetObjects [arrow-adbc]
via GitHub
Re: [I] Support arrow list and large_list dtypes when ingesting to snowflake [arrow-adbc]
via GitHub
[I] [R] Import of S3 methods from bit64 [arrow]
via GitHub
Re: [I] [R] Import of S3 methods from bit64 [arrow]
via GitHub
Re: [I] [C++] Allow more Flatbuffers versions to compile Arrow [arrow]
via GitHub
Re: [I] [C++] Error linking the util/cancel.h [arrow]
via GitHub
[I] [Python] Table.from_pylist on ExtensionType column with list_ storage crashes when values exceed int32 offsets [arrow]
via GitHub
[I] [C++][Parquet] Undefined behavior in `TypedColumnWriterImpl::UpdateLevelHistogram` [arrow]
via GitHub
Re: [I] [C++][Parquet] Undefined behavior in `TypedColumnWriterImpl::UpdateLevelHistogram` [arrow]
via GitHub
[I] R - FinalizeS3 segfault [arrow]
via GitHub
[I] [C++][Parquet] Add bloom filter folding to automatically size SBBF filters [arrow]
via GitHub
Re: [I] [C++][FlightRPC] ODBC macOS `.pkg` Installer [arrow]
via GitHub
[I] [C++] Use FetchContent for RapidJSON [arrow]
via GitHub
Re: [I] [C++] Use FetchContent for RapidJSON [arrow]
via GitHub
Re: [I] [C++] Address "Compatibility with CMake < 3.5 has been removed" error [arrow]
via GitHub
Re: [I] Python Snowflake Driver has incorrect documentation of adbc.rpc.result_queue_size [arrow-adbc]
via GitHub
[I] [CI][Python] Revert pinning miniforge once mamba solver issue is resolved [arrow]
via GitHub
Re: [I] snowflake: `adbc_ingest` will fail with "double free" segmentation fault if record batch schema is incorrect [arrow-adbc]
via GitHub
[I] [C++] Some builds fail to build due to gRPC failures [arrow]
via GitHub
Re: [I] [C++] Some builds fail to build due to gRPC failures [arrow]
via GitHub
[I] [C++][FlightRPC] <grpcpp/version_info.h> not found [arrow]
via GitHub
Re: [I] [C++][FlightRPC] <grpcpp/version_info.h> not found [arrow]
via GitHub
[I] [R] Support for Tensor class [arrow]
via GitHub
Re: [I] [R] Support for Tensor class [arrow]
via GitHub
[I] [R] open_dataset with root directory inaccessible? [arrow]
via GitHub
Re: [I] [R] open_dataset with root directory inaccessible? [arrow]
via GitHub
[I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
Re: [I] [C++] Implement HTTP and FTP file systems [arrow]
via GitHub
[I] csharp: driver manager incorrectly loads and validates manifests [arrow-adbc]
via GitHub
Re: [I] csharp: driver manager incorrectly loads and validates manifests [arrow-adbc]
via GitHub
[I] [CI][Python] AMD64 Conda Python 3.10 Pandas 1.3.4 job consistently timing out [arrow]
via GitHub
Re: [I] [CI][Python] AMD64 Conda Python 3.10 Pandas 1.3.4 job consistently timing out [arrow]
via GitHub
[I] csharp: literal strings not supported by driver manager toml parser [arrow-adbc]
via GitHub
Re: [I] csharp: literal strings not supported by driver manager toml parser [arrow-adbc]
via GitHub
[I] csharp: driver manager missing mac search path [arrow-adbc]
via GitHub
Re: [I] csharp: driver manager missing mac search path [arrow-adbc]
via GitHub
[I] [R][Wasm] Fix Error: thread constructor failed: Not supported under Wasm [arrow]
via GitHub
[I] [CI] Drop obsolete
[email protected]
brew uninstall from cpp.yml and python.yml [arrow]
via GitHub
Re: [I] [CI] Drop obsolete
[email protected]
brew uninstall from cpp.yml and python.yml [arrow]
via GitHub
Re: [I] [C#] BitUtility.cs performance improvement [arrow]
via GitHub
[I] BitUtility.cs performance enhancement [arrow-dotnet]
via GitHub
[I] Fix Unity build ordering issue [arrow]
via GitHub
Re: [I] [C++][FlightRPC] Fix Unity build ordering issue [arrow]
via GitHub
[I] parquet/file: NewParquetWriter panics on transient sink.Write errors during file initialization [arrow-go]
via GitHub
Re: [I] pyarrow tranport Tensor type data to java arrow flight server [arrow]
via GitHub
Re: [I] pyarrow tranport Tensor type data to java arrow flight server [arrow]
via GitHub
[I] [Release][Packaging] Add Reproducible Builds for RPM based packages [arrow]
via GitHub
[I] [Release][Packaging] Add Reproducible build for Debian Packages [arrow]
via GitHub
Re: [I] [Release][Packaging] Add Reproducible build for Debian Packages [arrow]
via GitHub
[I] [CI][Packaging] Use random build directory path for Debian Packages instead of fixed one [arrow]
via GitHub
[I] [Java] DictionaryEncoder doesn't crash when decoding index outside of Dictionary [arrow-java]
via GitHub
[I] Integration tests failing with Rust producing and .NET consuming binary views [arrow-dotnet]
via GitHub
Re: [I] Integration tests failing with Rust producing and .NET consuming binary views [arrow-dotnet]
via GitHub
[I] [C++][Gandiva] Duplicate function aliases with same parameters [arrow]
via GitHub
[I] [postgresql] have a way of telling the driver to avoid `ROLLBACK AND CHAIN` [arrow-adbc]
via GitHub
Re: [I] unsupported cast to string_view from utf8 in v18 [arrow-go]
via GitHub
[I] c/driver/postgresql: adbc_ingest silently misaligns list/large_list/fixed_size_list rows when the source Arrow array is sliced (parent.offset > 0) [arrow-adbc]
via GitHub
[I] [Docs][C++][Parquet] Add API reference [arrow]
via GitHub
[I] [R][Packaging] Support building the R package under Emscripten [arrow]
via GitHub
Re: [I] [R][Packaging] Support building the R package under Emscripten [arrow]
via GitHub
[I] docker-amd64-ubuntu-memcheck verify job is failing [arrow-nanoarrow]
via GitHub
[I] New warning on gcc16 [arrow-nanoarrow]
via GitHub
Re: [I] New warning on gcc16 [arrow-nanoarrow]
via GitHub
Re: [I] [C++] Output batch size control in ExecPlan [arrow]
via GitHub
[I] [C++][Gandiva] Add 2 arg REGEXP_EXTRACT function [arrow]
via GitHub
[I] Support `expr.IntervalYearToMonthLiteral` in `literalToDatum` [arrow-go]
via GitHub
Re: [I] [C++] Improve error handling for hash table merges [arrow]
via GitHub
[I] [Java] Improve VectorSchemaRoot.getVector(String name) lookup performance [arrow-java]
via GitHub
[I] Explicitly providing CMAKE_LIBTOOL does not work on MacOS [arrow]
via GitHub
Re: [I] [C++] Explicitly providing CMAKE_LIBTOOL does not work on MacOS [arrow]
via GitHub
Re: [I] [C++] DictionaryArray::dictionary() is not thread safe [arrow]
via GitHub
Re: [I] [Document] Why int32() offset type is used for DenseUnionArray? [arrow]
via GitHub
Re: [I] [C++] CSV reader: Ability to not infer column types. [arrow]
via GitHub
[I] Fix remaining overflow and negative length handling issues in Gandiva string functions [arrow]
via GitHub
[I] Azure with SAS Keys [arrow]
via GitHub
[I] [GLib] Enable tests for custom extension data type [arrow]
via GitHub
[I] [Python][CI] Raise oldest NumPy wheel-test requirement to a patched release [arrow]
via GitHub
Re: [I] [Python][CI] Raise oldest NumPy wheel-test requirement to a patched release [arrow]
via GitHub
[I] [C++] IPC file fuzzer fails when footer schema has differing endianness [arrow]
via GitHub
[I] Question regarding Parquet Page Index: Why enable it during write if it's not utilized during read? [arrow-go]
via GitHub
Re: [I] [C++][Acero] Window Functions add helper classes for window aggregates and distinct aggregates [arrow]
via GitHub
Re: [I] [C++][Acero] Window Functions add helper classes for quantiles [arrow]
via GitHub
Re: [I] [C++][Acero] Add Window Functions exec node [arrow]
via GitHub
Re: [I] [C++/Python] Add support for S3 Bucket Versioning [arrow]
via GitHub
[I] [Avro] hamba/avro is abandoned [arrow-go]
via GitHub
[I] [Python] Improve Extension Types Support in PyArrow (umbrella issue) [arrow]
via GitHub
Re: [I] [Python] Subclassing the PyExtensionType and getting it's bit_width attribute returns Non-fixed width type ValueError [arrow]
via GitHub
[I] The annotation is incorrect. It should be 1M. [arrow-go]
via GitHub
Re: [I] The annotation is incorrect. It should be 1M. [arrow-go]
via GitHub
[I] [C++][Parquet] Avoid unbounded temp alloc in BYTE_STREAM_SPLIT decoder [arrow]
via GitHub
Re: [I] [C++] Support optional arguments in aggregation function mapping in the Substrait consumer. [arrow]
via GitHub
Re: [I] [R] Differing results in log bindings [arrow]
via GitHub
Re: [I] [Python][Dev] Document the process to run numpydoc checks [arrow]
via GitHub
Re: [I] [R] Implement asof join [arrow]
via GitHub
Re: [I] Clean up how the CSV reader handles the first buffer [arrow]
via GitHub
Re: [I] [R] Tidy up the pkgdown articles site index [arrow]
via GitHub
Re: [I] [R] arrow_eval: do we need both nse_funcs and .cache$functions? [arrow]
via GitHub
Re: [I] [C++] [Python] Major performance improvements to CSV reading from S3 [arrow]
via GitHub
Re: [I] [R] Table viewer for knitr/notebooks [arrow]
via GitHub
[I] [C++][Dataset] std::bad_weak_ptr in multi-threaded writer tests on MinGW gcc-16 [arrow]
via GitHub
Re: [I] [C++][CI] MinGW GCC 16.1 regression - shared_ptr corruption in multi-threaded tests [arrow]
via GitHub
[I] Managing ownership in VectorSchemaRoot#addVector, recent changes miss the main fault. [arrow-java]
via GitHub
Re: [I] [R] [Docs] Improve (or really actually document) our Python bridge documentation [arrow]
via GitHub
Re: [I] [C++] Fetch Node Substrait Integration [arrow]
via GitHub
[I] [C++][Parquet] Reading dictionary encoded boolean throws NYI [arrow]
via GitHub
Re: [I] [C++] Substarit End-To-End Tests for Relations [arrow]
via GitHub
Re: [I] [R] Allow unrecognized R expressions to be callable as compute::Functions [arrow]
via GitHub
Re: [I] [R] Add vignette on ExecPlans and how they work [arrow]
via GitHub
Re: [I] [Python] Memory kept after del and pool.released_unused() [arrow]
via GitHub
Re: [I] Does arrow support access S3 based on 'path-style'? [arrow]
via GitHub
Re: [I] [C++] RecordBatch Make() with Arrow Arrays could infer length [arrow]
via GitHub
Re: [I] [C++][Parquet] Support nested data conversions for chunked array [arrow]
via GitHub
[I] [GLib] garrow_data_type_new_raw segfaults on arrow::extension::OpaqueType and any non-GLib ExtensionType (ADBC PostgreSQL NUMERIC) [arrow]
via GitHub
Re: [I] [GLib] garrow_data_type_new_raw segfaults on arrow::extension::OpaqueType and any non-GLib ExtensionType (ADBC PostgreSQL NUMERIC) [arrow]
via GitHub
[I] [C++] Uncontrolled Memory Allocation (OOM) in Parquet Delta decoders [arrow]
via GitHub
[I] [C++][Gandiva] Use timegm in date_time_test utilities to avoid DST-dependent behavior [arrow]
via GitHub
Re: [I] [Python] `compute.count_distinct` not implemented for `extension<arrow.uuid>` and `extension<arrow.json>` [arrow]
via GitHub
Re: [I] [Python] `compute.min_max` is not implemented for `extension<arrow.json>` [arrow]
via GitHub
[I] [Bug] NewIntXStatistics factories unconditionally set hasDistinctCount=true, causing distinct_count=0 to always appear in Parquet output [arrow-go]
via GitHub
Re: [I] [Bug] NewIntXStatistics factories unconditionally set hasDistinctCount=true, causing distinct_count=0 to always appear in Parquet output [arrow-go]
via GitHub
[I] [C++] HeadBucket called in S3FS breaking IAM scoped prefixes [arrow]
via GitHub
Re: [I] [R] Implement typeof() in Arrow dplyr queries [arrow]
via GitHub
Re: [I] [R] Implement as.integer and as.numeric for timestamp types etc. in Arrow dplyr queries [arrow]
via GitHub
Re: [I] [R]: Lack of `assume_timezone` binding [arrow]
via GitHub
Re: [I] [C++] Move Parquet APIs to use Result instead of Status [arrow]
via GitHub
Re: [I] [C++][Python][Doc] Document that order is not preserved when writing dataset with use_threads=True [arrow]
via GitHub
Re: [I] [C++][Python] SEGFAULT when casting FixedSizeTensorArray to storage type then back to FixedSizeTensorArray [arrow]
via GitHub
Re: [I] [Python] ParquetWriter use_compliant_nested_type=True does not preserve ExtensionArray when reading back [arrow]
via GitHub
Re: [I] [Python] `pyarrow.Table.to_pandas` creates Index instead of PeriodIndex [arrow]
via GitHub
[I] [C++][CI] gcc-16 MinGW failures - remaining fixes (follow-up to #49930) [arrow]
via GitHub
Re: [I] [C++][CI] gcc-16 MinGW failures - remaining fixes (follow-up to #49930) [arrow]
via GitHub
[I] [Format] Better document IPC file and stream equivalence [arrow]
via GitHub
[I] [C++] Provide a default implementation for ExtensionType::ExtensionEquals [arrow]
via GitHub
Re: [I] Enhancement Request: Custom Operator Support for PyArrow Extension Types in Compute Functions [arrow]
via GitHub
Re: [I] [Python] For extension types, compute kernels should default to storage types? [arrow]
via GitHub
Re: [I] [Python] Enhance logical operator support and truth value handling for PyArrow Arrays [arrow]
via GitHub
Re: [I] [C++] The difference between namespace detail and internal [arrow]
via GitHub
Re: [I] [C++] Add support for %Z to strptime [arrow]
via GitHub
Re: [I] [C++][Compute] Add Cryptographic hash functions to Acero [arrow]
via GitHub
Re: [I] [C++] [Python] Tag record batches with start_byte and end_byte infromation [arrow]
via GitHub
Re: [I] [R] Update binding for add_filename() NSE function to error if used on Table [arrow]
via GitHub
Re: [I] [C++] Disable anonymous namespaces in debug mode [arrow]
via GitHub
Re: [I] [R] Additional dplyr functionality [arrow]
via GitHub
[I] Power BI - using UTF8_LCASE column returns error Unable to understand the type for column [arrow-adbc]
via GitHub
Re: [I] Power BI - using UTF8_LCASE column returns error Unable to understand the type for column [arrow-adbc]
via GitHub
Re: [I] [Python] Dataset.to_batches() / ParquetFileFragment.to_batches() hang forever [arrow]
via GitHub
[I] [Go] RecordFromJSON does not handle integer values not representable by double [arrow-go]
via GitHub
Re: [I] [Go] RecordFromJSON does not handle integer values not representable by double [arrow-go]
via GitHub
Re: [I] [Go] RecordFromJSON does not handle integer values not representable by double [arrow-go]
via GitHub
[I] [FlightSQL] SQLite example pulls GPL-licensed modernc.org/ccorpus into all consumers' go.sum [arrow-go]
via GitHub
Re: [I] [FlightSQL] SQLite example pulls GPL-licensed modernc.org/ccorpus into all consumers' go.sum [arrow-go]
via GitHub
[I] [Python] Protect PyBuffer and NumPyBuffer destructors against interpreter finalization [arrow]
via GitHub
Re: [I] [Python] Protect PyBuffer and NumPyBuffer destructors against interpreter finalization [arrow]
via GitHub
[I] [R] Update macOS CRAN job SDK from 11.3 to 14.5 to match R 4.6.0 build environment [arrow]
via GitHub
Re: [I] [C++] Bump versions of bundled dependencies [arrow]
via GitHub
[I] [C++] Bump bundled c-ares [arrow]
via GitHub
Re: [I] [C++] Bump bundled c-ares [arrow]
via GitHub
Re: [I] [Python] Interchange object data buffer has the wrong dtype / `from_dataframe` incorrect [arrow]
via GitHub
Re: [I] [Python][Interchange protocol] Export boolean columns as bit-packed values [arrow]
via GitHub
Re: [I] [Python] DataFrame interchange protocol: NaNs are interchanged as null [arrow]
via GitHub
Re: [I] [Python] Cannot create RecordBatch with nested struct containing extension type [arrow]
via GitHub
Re: [I] [Parquet][Python] parquet arrow schema inconsistent for file with UUID [arrow]
via GitHub
Re: [I] [Python] Instantiating arrays with type ListType[ExtensionType] is not supported [arrow]
via GitHub
Re: [I] Extension types not fully supported in list arrays [arrow]
via GitHub
Re: [I] [C++][Compute] Support to initialize expression with a string [arrow]
via GitHub
Re: [I] [C++] Add hash_mode function [arrow]
via GitHub
Re: [I] [C++] Implement the round-shift for fixed size data type [arrow]
via GitHub
Re: [I] [C++][Docs] Describe limitations and alternatives for handling dependencies via package managers [arrow]
via GitHub
Re: [I] [Docs][C++] Add missing methods to ArrayBuilders API Reference [arrow]
via GitHub
Re: [I] [C++] Add Byte Range to CSV Reader ReadOptions [arrow]
via GitHub
Re: [I] [Python] Error using extension types in struct in PyArrow [arrow]
via GitHub
[I] BUG: Pandas BUG: DataFrame.fillna() with ArrowDtype(pa.null()) columns causes Arrow C++ assertion failure (core dump) [arrow]
via GitHub
Re: [I] [Python] BUG: Pandas BUG: DataFrame.fillna() with ArrowDtype(pa.null()) columns causes Arrow C++ assertion failure (core dump) [arrow]
via GitHub
Re: [I] [Python] Accessing parquet files with parquet.read_table in google cloud storage fails, but works with dataset, works in 16.1.0 fails in 17.0.0 [arrow]
via GitHub
Earlier messages