issues
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: [I] [C++] Add Compute Kernel for Casting from struct to string [arrow]
via GitHub
Re: [I] [C++] Add support for nested types in StringFormatter [arrow]
via GitHub
Re: [I] [C++][Parquet][Benchmark] Adding benchmarking for reading Statistics [arrow]
via GitHub
Re: [I] [C++] Asof-joins inefficiently copy the left hand side [arrow]
via GitHub
Re: [I] [C#] Add initial GObject based Dataset library [arrow]
via GitHub
Re: [I] [CI][Python] Enable pycodestyle check for Cython related files [arrow]
via GitHub
Re: [I] [C++] fine-grained span for hash join node [arrow]
via GitHub
Re: [I] [Python][Docs] `RecordBatch.from_pydict()` and `RecordBatch.from_pylist()` example code references `pa.Table` instead of `pa.RecordBatch` [arrow]
via GitHub
Re: [I] [Python][Docs] Improve cross-references on pyarrow write_to_dataset docs [arrow]
via GitHub
Re: [I] [Python] failed to concat_tables with nested struct [arrow]
via GitHub
Re: [I] Function 'add_checked' has no kernel matching input types (bool, bool) [arrow]
via GitHub
Re: [I] pyarrow.parquet.write_table compression arg expects a column name suffix sometimes [arrow]
via GitHub
Re: [I] [C++] Rename the fixed-width concept from fixed_width_internal.h to "generalized fixed-width" [arrow]
via GitHub
Re: [I] Add a metadata object to pyarrow.RecordBatchReader [arrow]
via GitHub
Re: [I] [C++][Compute] ScalarExecutor and VectorExecutor's can be improved by support more precise pre-allocation and more pre-allocated types [arrow]
via GitHub
Re: [I] [Python] Provide a way to specify the maximum chunk size to `pyarrow.flight.RecordBatchStream` [arrow]
via GitHub
Re: [I] [C++] Don't use VisitTypeInline in DictionaryArray::Compact() [arrow]
via GitHub
Re: [I] [C++] Equals method on CudaBuffer [arrow]
via GitHub
Re: [I] [Python] Add max_threads parameter to Dataset methods [arrow]
via GitHub
Re: [I] [C++] Make the set of casts and hash kernels involving float16 consistent with other floating types [arrow]
via GitHub
Re: [I] [Python] Dataset sorting_columns support request [arrow]
via GitHub
Re: [I] [C++] Add more efficient kernels for list_slice when input and output are list-views [arrow]
via GitHub
Re: [I] [C++][Arrow][Parquet] Are there any plans to support decimal NaN? [arrow]
via GitHub
Re: [I] [pyarrow] support for casting from python's ctypes (or directly from byte objects) [arrow]
via GitHub
Re: [I] [C++] Implement cast suggestions for all nested types in Concatenate [arrow]
via GitHub
Re: [I] [CI][R] Add CI jobs for windows aarch64 [arrow]
via GitHub
Re: [I] [Python] `pa.Table.from_pylist` support list of tuples? [arrow]
via GitHub
Re: [I] [C++] Vendor googletest [arrow]
via GitHub
Re: [I] [CI][Go][Java][JavaScript] Add new CI to ARM64-based macOS [arrow]
via GitHub
Re: [I] [C++] Support casting between list-view and list-like types [arrow]
via GitHub
Re: [I] Feature: Implement uint64 type inference pyarrow, for pa.Table.from_py* [arrow]
via GitHub
Re: [I] [C++] Copy with compaction to a different device [arrow]
via GitHub
Re: [I] [R] Speed up `nrow()` on filtered dataset [arrow]
via GitHub
Re: [I] [C++][Acero] SourceNode ToString should print *something* useful [arrow]
via GitHub
Re: [I] [CI] Attach lint failures to PR diff view [arrow]
via GitHub
Re: [I] [C++][Parquet] BloomFilter writer: Estimate the bloom filter quality [arrow]
via GitHub
Re: [I] Python / PyArrow: pyarrow.feather support for streaming files [arrow]
via GitHub
Re: [I] Proposal: generic streaming protocol for columnar data [arrow]
via GitHub
Re: [I] [C++][Parquet] Parquet Arrow: Better interface for coerce_types handling [arrow]
via GitHub
Re: [I] [Python] Extend PyCapsule interface with support for KeyValueMetadata object [arrow]
via GitHub
Re: [I] Allow non-async server to cancel in-progress Flight RPC calls [arrow]
via GitHub
Re: [I] [Python] Command line execution of PyArrow and submodules [arrow]
via GitHub
Re: [I] [IPC] Concatenate array chunks while transferring Arrow arrays over the wire [arrow]
via GitHub
Re: [I] [C++] IO: InputStream::Advance will always read from Stream [arrow]
via GitHub
Re: [I] [Python] Current assertion of CPU-accessible data in Array methods is specific to CPU device type [arrow]
via GitHub
Re: [I] Allow `columns` to be `logically renamed` in pyarrow dataset on fragment level [arrow]
via GitHub
Re: [I] [Integration] Extend integration tests to allow implementations to handle only file or stream [arrow]
via GitHub
Re: [I] Add with_value_type to ListType [arrow]
via GitHub
Re: [I] [Python] Add bindings for additional Buffer class non-CPU methods [arrow]
via GitHub
Re: [I] [C++] Compute: RowEncoder eliminates offsets when all columns are fixed-sized [arrow]
via GitHub
Re: [I] [C++][Python][R] Add cpplint pre-commit checks to R and Python C++ code [arrow]
via GitHub
Re: [I] [C++][Python] Add Python wrapper for JSON canonical extension type [arrow]
via GitHub
Re: [I] [Python] Add row indices in parquet fragment to include/exclude [arrow]
via GitHub
Re: [I] [Python] Dedicated flavor value for `DirectoryPartitioning` [arrow]
via GitHub
Re: [I] [C++] Compute: RowEncoder supports a unsafe_encoded_row to get the std::string_view for row [arrow]
via GitHub
Re: [I] [Docs][HTTP] Recommend method(s) for resuming interrupted uploads? [arrow]
via GitHub
Re: [I] [C++][Compute] Provide date extracting functions dedicated for PostgreSQL time data type [arrow]
via GitHub
Re: [I] [R] Let `na.rm` of `mean()` support removing `NaN` as in base R [arrow]
via GitHub
Re: [I] [C++] DecimalRealConversion could multiply by 5 instead of 10 [arrow]
via GitHub
Re: [I] [C++][Python] Allow Schema unification when concatenating RecordBatch [arrow]
via GitHub
Re: [I] [C++] Improve Decimal32 conversion from Real [arrow]
via GitHub
Re: [I] [C++][FS][Azure] Improve error handling [arrow]
via GitHub
Re: [I] [Python][Docs] Add new examples to CUDA integration docs [arrow]
via GitHub
Re: [I] [Python] Missing test cases in all_array_types [arrow]
via GitHub
Re: [I] [C++] Use S3 conditional writes to avoid some explicit checks [arrow]
via GitHub
Re: [I] [C++][Flight] Add support for passing pointer options down to transports [arrow]
via GitHub
Re: [I] [Python][Parquet] Read encrypted parquet datasets via _metadata [arrow]
via GitHub
Re: [I] Add support for struct type in hash_list aggregation [arrow]
via GitHub
Re: [I] Improving the pa.Table.from_pylist method [arrow]
via GitHub
Re: [I] `join`ing tables with ExtensionArrays [arrow]
via GitHub
Re: [I] Create testing functions for PyArrow.Table comparisons like `pandas.testing.assert_frame_equals()` [arrow]
via GitHub
Re: [I] [C++][CI] Add a job to exercise cross-compiling to our nightly CI [arrow]
via GitHub
Re: [I] [Python] Add better error reporting for missing items on classpath for HadoopFileSystem [arrow]
via GitHub
Re: [I] [GLib][FlightRPC] Test sometimes crashed [arrow]
via GitHub
Re: [I] [C++] Use of Standatrd C++ callable in Filtering Dataset [arrow]
via GitHub
Re: [I] [C++] Improve performance of sequential access of ChunkResolver [arrow]
via GitHub
Re: [I] [Python] Add support for chown and chmod in HadoopFileSystem [arrow]
via GitHub
Re: [I] [Integration] Add integration test case for map type with extension key and/or value [arrow]
via GitHub
Re: [I] [Python] from_numpy_dtype-like mapping for Python types [arrow]
via GitHub
Re: [I] [C++][Parquet] Tools: uses base64 for FLBA/Binary type min-max to avoid non-utf8 string [arrow]
via GitHub
Re: [I] [C++] dictionary array transpose not handle null in kernel function `TransposeInts` [arrow]
via GitHub
Re: [I] [CI] Use micromamba instead of miniforge3 [arrow]
via GitHub
Re: [I] [Python] Support lists of sources and destinations on `pyarrow.fs.copy_files()` [arrow]
via GitHub
Re: [I] [C++] Minor enhance the doc and impl of DictionaryArray::Transpose [arrow]
via GitHub
[I] [Python][Packaging] Wrong ARROW_SIMD_LEVEL=SSE4_2 on arm64 macOS wheels [arrow]
via GitHub
[I] Check that YMM register saving is enabled before using AVX at runtime [arrow]
via GitHub
Re: [I] Check that YMM register saving is enabled before using AVX at runtime [arrow]
via GitHub
[I] [R] Update NEWS.md for 23.0.1 [arrow]
via GitHub
Re: [I] [R] Update NEWS.md for 23.0.1 [arrow]
via GitHub
Re: [I] [C++][Dataset] Simplify ScanOptions after complexity has moved to ScanNode [arrow]
via GitHub
Re: [I] Cython API does not mirror all C++ functions [arrow]
via GitHub
Re: [I] [C++] Implement hash_aggregate kernels (umbrella issue) [arrow]
via GitHub
Re: [I] [C++] Should NaN comparison return false or NaN/NA? [arrow]
via GitHub
Re: [I] [C++] Clean up docs for log/trig functions, MakeUnaryArithmeticFunctionFloatingPoint [arrow]
via GitHub
Re: [I] [C++][Python] Revisit compute functions names [arrow]
via GitHub
[I] [R] CRAN packaging checklist for version 23.0.1 [arrow]
via GitHub
Re: [I] [R] CRAN packaging checklist for version 23.0.1 [arrow]
via GitHub
Re: [I] [C++] Add systematic tests for Builder::AppendArraySlice [arrow]
via GitHub
[I] [Python] Add a PyArrow sanitizers build [arrow]
via GitHub
[I] [Dev][CI] Email reports are not being sent due to network issues sending emails [arrow]
via GitHub
Re: [I] [Dev][CI] Email reports are not being sent due to network issues sending emails [arrow]
via GitHub
Re: [I] java/driver/flight-sql: getObjects needs to include empty catalogs/schemas [arrow-adbc]
via GitHub
[I] [Ruby] Ensure using C++20 [arrow]
via GitHub
Re: [I] [Ruby] Ensure using C++20 [arrow]
via GitHub
[I] [Ruby] Add support for auto dependency install for red-arrow on macOS [arrow]
via GitHub
Re: [I] [Ruby] Add support for auto dependency install for red-arrow on macOS [arrow]
via GitHub
[I] [CI][Packaging][deb] Add support for minor release in `dev/release/verify-apt.sh` [arrow]
via GitHub
Re: [I] [CI][Packaging][deb] Add support for minor/patch release in `dev/release/verify-apt.sh` [arrow]
via GitHub
Re: [I] [JS] Fix instanceof or move away from instanceof within arrow-js [arrow-js]
via GitHub
[I] [C++][CI] Use differential fuzzing on IPC file fuzzer [arrow]
via GitHub
Re: [I] [C++][CI] Use differential fuzzing on IPC file fuzzer [arrow]
via GitHub
[I] NoSuchMethodError with gRPC Java 1.79.0 [arrow-java]
via GitHub
[I] Segmentation fault in pyarrow.compute.if_else: arguments have string type but the result only fits in large_string type [arrow]
via GitHub
[I] Table.to_batches() loses schema information when table has zero rows [arrow]
via GitHub
[I] [Benchmarks] Install R from conda again [arrow]
via GitHub
Re: [I] [Benchmarks] Revert rig-based R installation in benchmark hooks [arrow]
via GitHub
[I] [Python] Expose RecordBatchFileReader::CountRows in Python [arrow]
via GitHub
Re: [I] `ListViewVector#getElementEndIndex` Returns Size not Index [arrow-java]
via GitHub
Re: [I] [Java] Checker Framework annotations for AutoCloseables in arrow-memory-core are incorrect [arrow-java]
via GitHub
Re: [I] [C++][Gandiva] Add format_number hive function to gandiva [arrow]
via GitHub
Re: [I] [C++][Doc] Fix warnings generated by sphinx when incorporating doxygen docs [arrow]
via GitHub
Re: [I] [C++] Implement hash_aggregate mode kernel [arrow]
via GitHub
Re: [I] [C++] Implement hash_aggregate exact quantile kernel [arrow]
via GitHub
Re: [I] [C++] Simplify Call("case_when", ...) expressions [arrow]
via GitHub
Re: [I] BaseListBuilder constructor should check the provided type is a list [arrow]
via GitHub
Re: [I] [Packaging] Should we maintain the Arch linux repository? [arrow]
via GitHub
[I] format: provide way to escape wildcards in GetObjects [arrow-adbc]
via GitHub
[I] format: expose data types in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose more catalog/schema/table/column properties [arrow-adbc]
via GitHub
[I] [Go] Implement GetStatistics [arrow-adbc]
via GitHub
Re: [I] [Go] Implement GetStatistics [arrow-adbc]
via GitHub
Re: [I] [Go] Implement GetStatistics [arrow-adbc]
via GitHub
[I] [Go] Provide more detail on table types in GetObjects [arrow-adbc]
via GitHub
Re: [I] [Go] Provide more detail on table types in GetObjects [arrow-adbc]
via GitHub
[I] format: expose partition info in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose sequences in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose triggers in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose more metadata about foreign keys [arrow-adbc]
via GitHub
[I] format: expose referenced columns in catalog metadata [arrow-adbc]
via GitHub
Re: [I] format: expose referenced columns in catalog metadata [arrow-adbc]
via GitHub
[I] format: add `constraint_expression` field to `CONSTRAINT_SCHEMA` [arrow-adbc]
via GitHub
[I] c/driver/postgresql: return exclusion constraints [arrow-adbc]
via GitHub
[I] format: expose indexes in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose table/view definitions in catalog metadata [arrow-adbc]
via GitHub
[I] format: expose functions/procedures in catalog metadata [arrow-adbc]
via GitHub
[I] [Python][Docs] Incorrect parameter descriptions in SparseCSCMatrix.from_numpy [arrow]
via GitHub
[I] [C++][Parquet] Integer overflow in Parquet dict decoding [arrow]
via GitHub
Re: [I] [C++][Parquet] Integer overflow in Parquet dict decoding [arrow]
via GitHub
Re: [I] [Python] Better Error Message for ValueError [arrow]
via GitHub
Re: [I] [Integration] Add Rust map [arrow]
via GitHub
Re: [I] [C++] Add StopToken to ExecNode [arrow]
via GitHub
Re: [I] [C++][Compute] Allow Fragments to attach guarantees to scanned batches [arrow]
via GitHub
Re: [I] [C++] Distinguish between async-reentrant generators and unsafe generators [arrow]
via GitHub
Re: [I] [C++][Dataset] Enforce single use of Scanner [arrow]
via GitHub
Re: [I] [C++] [Dataset] Improve the _metadata example to show how to properly create _metadata if there is a partitioning [arrow]
via GitHub
[I] Use released apache/arrow instead of main for apache/arrow-java release [arrow-java]
via GitHub
Re: [I] Use released apache/arrow instead of main for apache/arrow-java release [arrow-java]
via GitHub
[I] Update vcpkg revision for Linux JNI build [arrow-java]
via GitHub
[I] [Python] test_memory.py fails with `-DARROW_MIMALLOC=OFF` [arrow]
via GitHub
[I] [Packaging][deb] Update `debian/watch` for version 5 [arrow]
via GitHub
[I] [C++] Add support for armv8 and armv9 [arrow]
via GitHub
Re: [I] [C++] Add support for armv8 and armv9 [arrow]
via GitHub
[I] [C++] Change the C Type of HalfFloatType [arrow]
via GitHub
[I] python: wheels fail to build on linux/amd64 [arrow-adbc]
via GitHub
Re: [I] python: wheels fail to build on linux/amd64 [arrow-adbc]
via GitHub
[I] docs: build is failing [arrow-adbc]
via GitHub
Re: [I] docs: build is failing [arrow-adbc]
via GitHub
[I] dev/release: nightly verification is failing [arrow-adbc]
via GitHub
Re: [I] dev/release: nightly verification is failing [arrow-adbc]
via GitHub
[I] Consider batching calls to GC.AddMemoryPressure [arrow-dotnet]
via GitHub
Re: [I] [CI] Remove extra ubuntu-r-only-r service from docker-compose.yml [arrow]
via GitHub
Re: [I] [C++][Gandiva] Implement parse_url hive function [arrow]
via GitHub
Re: [I] [C++] Kernel to convert timestamp with timezone to another timezone (metadata-only change) [arrow]
via GitHub
Re: [I] [C++] Support ORC in Arrow Dataset [arrow]
via GitHub
Re: [I] [Python][CI] macOS wheel builds should raise on linker warnings [arrow]
via GitHub
Re: [I] [R] Implement type determination more cleanly [arrow]
via GitHub
Re: [I] [C++] Optimize ArrayBuilder::AppendScalar [arrow]
via GitHub
[I] [C++][ORC] Add OrcFileFragment with stripe-level subsetting [arrow]
via GitHub
[I] [C++][R] Clean up macOS C++20 partial compatibility issues [arrow]
via GitHub
Re: [I] [C++][R] Clean up macOS C++20 partial compatibility issues [arrow]
via GitHub
[I] [Python] RecordBatch.serialize() should support writing into a pre-allocated buffer [arrow]
via GitHub
Re: [I] [C++] Reducing branching in compute/kernels/vector_selection.cc [arrow]
via GitHub
Re: [I] [R] Add bindings for ProjectOptions [arrow]
via GitHub
Re: [I] [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder [arrow]
via GitHub
Re: [I] [C++][Compute] Extract preallocation logic from KernelExecutor [arrow]
via GitHub
Re: [I] [R] Convert named lists to structs instead of (unnamed) lists [arrow]
via GitHub
Re: [I] [C++][Compute] Dispatch* should examine options as well as input types [arrow]
via GitHub
Re: [I] [R] Improve handling of R scalars in some nse_funcs [arrow]
via GitHub
[I] Feature Request: [postgresql] Comprehensive Metadata Discovery for Data Development Tools [arrow-adbc]
via GitHub
[I] [C++][FlightRPC] Add separate release & debug workflows for MacOS ODBC [arrow]
via GitHub
[I] [C++] Synthetic OOM tests are allocator-sensitive with mimalloc [arrow]
via GitHub
[I] [Python][Doc] Add examples for non-dense sparse tensor constructors [arrow]
via GitHub
[I] [Doc] Update docs to specify disclosure of AI on mailing list messages [arrow]
via GitHub
[I] [Doc][C++] Document security model for Arrow C++ [arrow]
via GitHub
[I] [Python] Wheel / sdist build uses docstrings generation script we don't want to distribute [arrow]
via GitHub
Re: [I] [R] Bindings for glue() [arrow]
via GitHub
Re: [I] [C++] Allow producers to require callbacks are scheduled on a new thread [arrow]
via GitHub
Re: [I] [R] Bindings for str_replace_na() [arrow]
via GitHub
Re: [I] [C++] String formatting kernel [arrow]
via GitHub
Re: [I] [C++][Python] Allow to specify a dictionary type for dictionary_encode() function [arrow]
via GitHub
Earlier messages
Later messages