Messages by Thread
-
[I] [Python] Enable OpenTelemetry on PyArrow wheels [arrow]
via GitHub
-
Re: [I] [C++][Python] Failed to build pyarrow, missing Arrow C++ [arrow]
via GitHub
-
Re: [I] [C++][Python] Support binary_view in basic kernels [arrow]
via GitHub
-
[I] `pqarrow.SchemaField.IsLeaf()` unreliable because `ColIndex` is never set to -1 for non-leaves [arrow-go]
via GitHub
-
Re: [I] [Python] Pip install error for pyarrow 6.0.1 on Python 3.6.8 due to setuptools_scm transitive dependency [arrow]
via GitHub
-
Re: [I] Unable to load libhdfs [arrow]
via GitHub
-
Re: [I] [C++] CMake build of arrow libraries fails on Windows [arrow]
via GitHub
-
Re: [I] [C++] Vcpkg install error for abseil on windows when building Arrow C++ [arrow]
via GitHub
-
Re: [I] [Docs] Describe use of Jira Affects Version in Contributing docs [arrow]
via GitHub
-
Re: [I] [C++] Cannot install Arrow with Zstd on Windows [arrow]
via GitHub
-
Re: [I] [R] installation failure on R Studio Server [arrow]
via GitHub
-
Re: [I] [C++][Compute] Add Find method to Grouper [arrow]
via GitHub
-
Re: [I] [C++][Compute] Provide a default implementation of ExecNode::Pause/Resume [arrow]
via GitHub
-
Re: [I] [Python] Use IPC writing code for pickling RecordBatches [arrow]
via GitHub
-
Re: [I] [C++] Add an arrow::Table::GetFieldByName method [arrow]
via GitHub
-
Re: [I] [C++][Dataset] Remove UnionDataset in favor of UnionExecNode [arrow]
via GitHub
-
Re: [I] [Doc] Make main column width larger [arrow]
via GitHub
-
Re: [I] [C++] Improve performance of unpack64 [arrow]
via GitHub
-
[I] [R][CI] r-devdocs crossbow job fails during gap between C++ and R releases [arrow]
via GitHub
-
[I] [R] CRAN packaging checklist for version 23.0.1.1 [arrow]
via GitHub
-
[I] [Python][Parquet] Add options to control writing of Bloom filters to `parquet.write_table` [arrow]
via GitHub
-
Re: [I] [C++][Compute] Make a subset of compute:: available even if ARROW_COMPUTE=OFF [arrow]
via GitHub
-
Re: [I] [C++] Make index kernel work in exec plans [arrow]
via GitHub
-
Re: [I] [C++] [Dataset] Add optional scan type that tags batches with locational information [arrow]
via GitHub
-
Re: [I] [Gandiva] Support null data type for gandiva. [arrow]
via GitHub
-
Re: [I] [C++] Create utility for runtime warnings [arrow]
via GitHub
-
Re: [I] [R] Enable object name linter [arrow]
via GitHub
-
Re: [I] [R][CI] Clean up crossbow R templates [arrow]
via GitHub
-
Re: [I] format: support multiple result sets [arrow-adbc]
via GitHub
-
[I] CI: Python integration tests are being skipped in CI [arrow-dotnet]
via GitHub
-
[I] The IReadOnlyList indexer on top of BinaryArray doesn't return nulls [arrow-dotnet]
via GitHub
-
[I] [C++]: Work around `bit_width` not being available on MacOS's partially compatible C++20 build [arrow]
via GitHub
-
[I] [C++][R] More robust `libtool` checking [arrow]
via GitHub
-
[I] Basic compute/comparison kernels missing for string_view? [arrow]
via GitHub
-
Re: [I] [Java][Docs] Undocumented null return from CallHeaders.getAll() [arrow-java]
via GitHub
-
[I] [CI][C++] JNI build error: `'bit' file not found` [arrow]
via GitHub
-
Re: [I] [C++][Docs] Missing docs for many Datum members [arrow]
via GitHub
-
Re: [I] [C++] RecordBatch::Add/SetColumn w/ ArrayData [arrow]
via GitHub
-
Re: [I] Identify selected row when using filters [arrow]
via GitHub
-
Re: [I] Selective reading of rows for parquet file [arrow]
via GitHub
-
[I] [Ruby] Simplify reader tests [arrow]
via GitHub
-
Re: [I] [JAVA] Client is able to connect to GRPC_TLS flight server with GRPC_INSECURE [arrow-java]
via GitHub
-
[I] [Python][Dataset] Add filters parameter to orc.read_table() for predicate pushdown [arrow]
via GitHub
-
[I] [C++][Dataset] ORC predicate pushdown: full operator and type coverage [arrow]
via GitHub
-
[I] [C++][ORC] Add stripe statistics API to ORCFileReader [arrow]
via GitHub
-
[I] [C++][Dataset] Add OrcFileFragment with stripe filtering and predicate pushdown [arrow]
via GitHub
-
[I] [Python][Doc] Add import statement to `filters_to_expression` docstring example [arrow]
via GitHub
-
[I] IAM Auth Login for CloudSQL (GCP, Postgres, Python) [arrow-adbc]
via GitHub
-
[I] [C++] Remove deprecated APIs from v16-v18 releases [arrow]
via GitHub
-
Re: [I] I want to use arrow to recode some projects, but when I use arrow to read csv and compute some indicator, the speed of arrow c++ is even lower than python code, is there something wrong? [arrow]
via GitHub
-
Re: [I] [C++][Docs] Missing docs for ArrayData [arrow]
via GitHub
-
Re: [I] [CI] Add archery subcommand for comparing diffs of 2 CI runs [arrow]
via GitHub
-
Re: [I] [R] [Docs] Document py_to_r and r_to_py [arrow]
via GitHub
-
Re: [I] [C++][Docs] Scalars User Guide [arrow]
via GitHub
-
Re: [I] [CI] autotune cmake is broken [arrow]
via GitHub
-
Re: [I] [C++][Compute] Consider removing ScalarAggregateKernel [arrow]
via GitHub
-
Re: [I] [C++] Review/apply guidelines for comment tags [arrow]
via GitHub
-
Re: [I] [Documentation] Documentation Improvements [arrow]
via GitHub
-
Re: [I] [C++] Add option to coalesce kernel to treat NaN as null [arrow]
via GitHub
-
[I] Bindings with C library [arrow-nanoarrow]
via GitHub
-
[I] Documentation is profoundly unhelpful to Rubyists new to Arrow [arrow]
via GitHub
-
[I] [C++] Vendored date library does not respect TZDIR environment variable [arrow]
via GitHub
-
[I] Add custom_metadata support for RecordBatch IPC messages [arrow-swift]
via GitHub
-
[I] [Doc][Python] Simplify doctests in tables.pxi and types.pxi [arrow]
via GitHub
-
Re: [I] [C++][CMake] 16.0.0: build fails because missing boost detection [arrow]
via GitHub
-
[I] [CI][C++] paginator.h missing in S3PaginationBase.h [arrow]
via GitHub
-
[I] [Packaging] Add support for Ubuntu 26.04 [arrow]
via GitHub
-
[I] [R] Preserve row order in `write_dataset()` [arrow]
via GitHub
-
[I] flightsql/driver: tx.QueryContext with no args appears to ignore active transaction handle [arrow-go]
via GitHub
-
[I] Consider shipping a managed allocator [arrow-dotnet]
via GitHub
-
Re: [I] [C++] [Dataset] The CSV file format currently always disables multithreading [arrow]
via GitHub
-
Re: [I] [Python] Column with over 2GB size limit but still identified as String in schema [arrow]
via GitHub
-
Re: [I] [C++] Fully deprecate CompareOptions [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Parquet] Use arrow compute to determine min/max of dictionaries (possibly other arrays?) [arrow]
via GitHub
-
Re: [I] [Python] Better pytest parametrization for different compression codecs [arrow]
via GitHub
-
Re: [I] [R] More special handling for known errors in arrow_eval [arrow]
via GitHub
-
Re: [I] [C++][python] performance of read_table using filters on a partitioned parquet file [arrow]
via GitHub
-
Re: [I] [C++] Add option to is_nan kernel to return true on null [arrow]
via GitHub
-
Re: [I] [R] Some errors in tests on Darwin PPC due to locale and datetime: [ FAIL 11 | WARN 16 | SKIP 111 | PASS 6586 ] [arrow]
via GitHub
-
Re: [I] [R] arrow segfaults on macOS 13 on loading: address 0xdde0, cause 'memory not mapped' [arrow]
via GitHub
-
Re: [I] [R]: Build system sneaks in rpath which breaks loading: arrow.so: Library not loaded: @rpath/libarrow.1100.dylib [arrow]
via GitHub
-
Re: [I] [R] Seed is honored when using DBI but not after arrow::to_duckdb [arrow]
via GitHub
-
Re: [I] [R] `configure` does not fail when `nixlibs.R` exits with status 1 [arrow]
via GitHub
-
Re: [I] [R] Timezone handling in round-trip of POSIXct [arrow]
via GitHub
-
Re: [I] [R] Invalid: Float value was truncated converting to int32 [arrow]
via GitHub
-
Re: [I] [R] Missing exports needed to create socket-based RecordBatchStreamReader [arrow]
via GitHub
-
Re: [I] [R] Failed to parse string: '' as a scalar of type double [arrow]
via GitHub
-
Re: [I] [R] Unable to load the package arrow getting error [arrow]
via GitHub
-
[I] [C++] How to add support for ordering in `arrow::ArrayStatistics`? [arrow]
via GitHub
-
[I] [CI] C++ extra jobs are executed with the `CI: Extra: R` label [arrow]
via GitHub
-
Re: [I] [CI] Bump timeout on Integration pipeline [arrow]
via GitHub
-
Re: [I] [Integration] Time the integration tests and report durations [arrow]
via GitHub
-
Re: [I] [Python] Add support for "is" and "is not" to `pyarrow.parquet.filters_to_expression` [arrow]
via GitHub
-
Re: [I] [R] Refactor r/configure [arrow]
via GitHub
-
[I] [R] Update docs to reflect removal of OpenSSL 1.0 and 1.1 support [arrow]
via GitHub
-
Re: [I] [R] Developer setup guides need more context on SSL versions [arrow]
via GitHub
-
[I] [Python] Conversion to/from numpy 2.0+ new StringDType [arrow]
via GitHub
-
[I] [C++][FS][Azure] Expose parallel transfer config options available in the Azure SDK [arrow]
via GitHub
-
Re: [I] [C++] Add a type_singleton utility function [arrow]
via GitHub
-
[I] [C++][Parquet][CI] Add fuzzer for encoder/decoder roundtrip [arrow]
via GitHub
-
Re: [I] [CI][Python] Disable Dataset in "minimal" builds [arrow]
via GitHub
-
Re: [I] [R][CI] Bump the R versions we test to include 4.3 [arrow]
via GitHub
-
Re: [I] [R][Docs] Add docs on what dplyr + tidyverse functionality we support [arrow]
via GitHub
-
Re: [I] [R] Refactor repeated code into check_match function [arrow]
via GitHub
-
Re: [I] [Benchmarking][R] conbench is failing [arrow]
via GitHub
-
Re: [I] [Gandiva] Add support for literal variables [arrow]
via GitHub
-
Re: [I] [Python] Implement conversion between integer coded as floating points with NaN to an Arrow integer type [arrow]
via GitHub
-
Re: [I] [C++] Native result set adapter for PostgreSQL / libpq [arrow]
via GitHub
-
Re: [I] [C++] Disable ASAN when building io-hdfs-test.cc [arrow]
via GitHub
-
Re: [I] [Python] Appending to streamable table file format doesn't seem to work [arrow]
via GitHub
-
Re: [I] [C++] Native client interface to SQL Server / TDS protocol [arrow]
via GitHub
-
Re: [I] [C++][ORC] Enable copy free conversion for primitive types [arrow]
via GitHub
-
Re: [I] [C++] Native client interface to Clickhouse [arrow]
via GitHub
-
Re: [I] [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely [arrow]
via GitHub
-
Re: [I] [C++] Native database client for MariaDB / MySQL client protocol [arrow]
via GitHub
-
Re: [I] [Python][C++] MemoryPool is destructed before deallocating its buffers leads to segfault [arrow]
via GitHub
-
Re: [I] [C++] Enable copy free conversion for dictionary encoded string column in ORC adapter [arrow]
via GitHub
-
Re: [I] [Python] Update the documentation about Schema & Metadata usage [arrow]
via GitHub
-
Re: [I] [Python] Reading Parquet file crashes on windows - python3.8 [arrow]
via GitHub
-
Re: [I] [R] If pkg-config finds arrow on default search path, we don't know if it was built with ARROW_S3 [arrow]
via GitHub
-
Re: [I] [C++] Support LTO for R [arrow]
via GitHub
-
Re: [I] [Python] pyarrow deserialize return datetime.datetime [arrow]
via GitHub
-
Re: [I] [Python] Lose access to indices & dictionary roundtripping DictionaryArray to parquet file [arrow]
via GitHub
-
Re: [I] [Python] Manual dataset with timestamp partition type error [arrow]
via GitHub
-
Re: [I] [C++][Python] Python compute kernel tests assume C++ is built with utf8proc [arrow]
via GitHub
-
Re: [I] [C++][Python] Behavior of parquet.read_table with filter and parquets containing null [arrow]
via GitHub
-
Re: [I] [Python] pyarrow2.0.0 flight test crash on macOS [arrow]
via GitHub
-
Re: [I] [C++] Micro-optimize integer parsing [arrow]
via GitHub
-
Re: [I] [Python] Getting reference not found with ORC enabled pyarrow [arrow]
via GitHub
-
Re: [I] [Python][Packaging] Fix Homebrew Install Python 3 NumPy not found failure [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Timestamp ColumnDescriptor (from logical type) incorrectly showing ConvertedType as NONE [arrow]
via GitHub
-
Re: [I] [Integration] Enable Arrow to read Parquet files from Spark 2.x with illegal nulls [arrow]
via GitHub
-
Re: [I] [Python] Parquet reader cannot read large strings [arrow]
via GitHub
-
Re: [I] [C++][Compute] Overhaul CanCast() helper function [arrow]
via GitHub
-
Re: [I] Out-of-heap memory leaks in FlightClient.getStream [arrow]
via GitHub
-
Re: [I] [C++] Compilation failure in arrow/scalar.cc on Xcode 8.3.3 [arrow]
via GitHub
-
Re: [I] [Website] Write blog post about C++ endianness compatibility [arrow]
via GitHub
-
Re: [I] [C++] Dict index type ALWAYS gets coerced to int32 when saving to parquet [arrow]
via GitHub
-
Re: [I] PyArrow unable to read file with large string values [arrow]
via GitHub
-
Re: [I] [Python] Initial table.take(...) call takes much longer [arrow]
via GitHub
-
Re: [I] [C++] CSV streaming reader doesn't handle cancellation correctly [arrow]
via GitHub
-
Re: [I] [C++] arrow-threading-utility-test takes a long time [arrow]
via GitHub
-
Re: [I] [R] Build fails if dataset enabled but parquet is not [arrow]
via GitHub
-
Re: [I] [C++][Parquet] StatisticsAsScalars doesn't support Decimal conversion for int primitives [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Root message of parquet may contain repetition [arrow]
via GitHub
-
Re: [I] [C++] [Parquet] Primitive types have defined num_children [arrow]
via GitHub
-
Re: [I] [C++] [Python] Python tests fail if compiled with glog [arrow]
via GitHub
-
Re: [I] [C++] C++ IPC reading looks like it doesn't support uncompressed buffer convention for compressed buffers [arrow]
via GitHub
-
Re: [I] [R] Writing to Parquet from tibble Consumes Large Amount of Memory [arrow]
via GitHub
-
Re: [I] [Doc] Update crossbow docs for archery [arrow]
via GitHub
-
Re: [I] OSError: Invalid IPC stream: negative continuation token [arrow]
via GitHub
-
Re: [I] [Python][C++] S3FileSystem with proxy_options is very slow on Windows [arrow]
via GitHub
-
Re: [I] [Python] TypeError when accessing length of an invalid ListScalar [arrow]
via GitHub
-
Re: [I] [Python][C++] pa.total_allocated_bytes incorrect after switching the default allocator [arrow]
via GitHub
-
Re: [I] [Python] StructScalar Timestamp using .to_pandas() loses/converts type [arrow]
via GitHub
-
Re: [I] RecordBatchBuilder with uint dictionary creates signed int Batch [arrow]
via GitHub
-
Re: [I] [Python] bool value of scalars depends on data type [arrow]
via GitHub
-
Re: [I] [Documentation] SEO tags confused for some pages [arrow]
via GitHub
-
Re: [I] [C++][Gandiva] Performance issue for TreeExprBuilder::MakeIf when nested plenty times. [arrow]
via GitHub
-
Re: [I] [C++] Warning when compiling on ubunut 21.04 [arrow]
via GitHub
-
Re: [I] [C++] StructArray ToString method doesn't print field names [arrow]
via GitHub
-
Re: [I] [Python] HadoopFileSystem crash when called twice and Java was misconfigured [arrow]
via GitHub
-
Re: [I] [Python] Add DataType.to_numpy_dtype (equivalent of to_pandas_dtype, but for numpy) [arrow]
via GitHub
-
Re: [I] [C++] ArrowLog with FATAL level is not robust if running in the service [arrow]
via GitHub
-
Re: [I] [C++] Thread pool leaks memory when forking (and could maybe deadlock) if threads exist at the time of fork [arrow]
via GitHub
-
Re: [I] [C++][Parquet] StreamReader.SkipColumns slow [arrow]
via GitHub
-
Re: [I] [Python] Non-nullable schema fields not checked in Table.from_pydict [arrow]
via GitHub
-
Re: [I] [CI] [C++] TestToDateHolder test error [arrow]
via GitHub
-
Re: [I] [Dev] r_valgrind image doesn't use full parallelism [arrow]
via GitHub
-
Re: [I] Shared libraries linker error when using clang, C++ 20, and ld [arrow]
via GitHub
-
Re: [I] [C++] [Python] Dictionary equality not correct? [arrow]
via GitHub
-
Re: [I] [C++] Bump AWS SDK versions in ThirdpartyToolchain to build on GCC11 [arrow]
via GitHub
-
Re: [I] [C++] Tests maybe uninitialized compiler warnings [arrow]
via GitHub
-
Re: [I] [Docs] Incorrect contact email in Github [arrow]
via GitHub
-
Re: [I] [Python] Inconsistent handling of integer-valued partitions in dataset filters API [arrow]
via GitHub
-
Re: [I] [Python] Breaking API change in FSSpecHandler, requires metadata argument [arrow]
via GitHub
-
Re: [I] [C++] Add async version of the ORC Dataset scanner [arrow]
via GitHub
-
Re: [I] [C++] ThreadIndexer occasionally fails in CI with "Check failed: (thread_index) < (Capacity())" [arrow]
via GitHub
-
Re: [I] [C++][Python] Generated argument description for compute meta-functions not accurate [arrow]
via GitHub
-
Re: [I] [C++][Compute] Implicit cast should verify decimal precision [arrow]
via GitHub
-
Re: [I] [R] Bindings for stringr::str_extract/str_extract_all ~ "extract_regex" kernel [arrow]
via GitHub
-
Re: [I] [C++] S3FileSystem enable automatic temporary credential refreshing for AWS Instance Profile [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Reading dict pages is not reading all values? [arrow]
via GitHub
-
Re: [I] [C++][Parquet] Incremental decoding not tested [arrow]
via GitHub
-
Re: [I] [C++] Improve array size estimation to account for shared buffers [arrow]
via GitHub
-
Re: [I] [C++] TSAN error in ExecPlanExecution.SelfInnerHashJoinSink [arrow]
via GitHub