Messages by Thread
-
[PR] chore: Update Release instructions [datafusion]
via GitHub
-
[PR] fix: array_concat widens container variant for mixed List/LargeList inputs [datafusion]
via GitHub
-
[I] concat_ws on array arguments silently stringifies the arrays [datafusion]
via GitHub
-
[I] array_concat fails with internal error on mixed List + LargeList inputs [datafusion]
via GitHub
-
[PR] Skip map_expressions rebuild for Extension nodes with empty expressions [datafusion]
via GitHub
-
[I] Skip map_expressions rebuild for Extension nodes with empty expressions [datafusion]
via GitHub
-
[PR] refactor: Share left-side spill file across partitions on OOM fallback [datafusion]
via GitHub
-
[I] Introduce version-specific behavior in Spark expressions [datafusion]
via GitHub
-
[I] Update documentation with Ballista TUI details [datafusion-ballista]
via GitHub
-
[I] partition_by and filter on ExprFunctionExt [datafusion]
via GitHub
-
[PR] feat: capture per-query output from CometSqlFileTestSuite via system property [datafusion-comet]
via GitHub
-
[PR] test: add SQL tests documenting Spark encode behavior [datafusion-comet]
via GitHub
-
Re: [PR] perf: Optimize approx count distinct using bitmaps instead of hashsets for smaller datatypes [datafusion]
via GitHub
-
Re: [I] Spark cannot reclaim memory from native operators (spill callback returns 0) [datafusion-comet]
via GitHub
-
[PR] feat: add review-datafusion-pr Claude Code skill [datafusion-comet]
via GitHub
-
[I] Add AGENTS.md [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add multi-column support for null-aware anti joins [datafusion]
via GitHub
-
Re: [I] feat: Extend single-value NDV optimization to timestamp and interval types [datafusion]
via GitHub
-
[I] ClickBench partitioned run.sh restarts process per try, losing hot-run caches [datafusion]
via GitHub
-
[PR] chore: Rename concat-specific string builders, make pub(crate) [datafusion]
via GitHub
-
[PR] feat(unparser): Keep inner join `Filter → TableScan` predicates to `WHERE` instead of moving to `JOIN ON` [datafusion]
via GitHub
-
[PR] Add Teradata dialect [datafusion-sqlparser-rs]
via GitHub
-
[PR] fix: remove unnecessary `as_any()` to fix compilation error [datafusion]
via GitHub
-
[PR] perf: parallelize CPU-heavy parquet metadata parsing in `list_files_for_scan` [datafusion]
via GitHub
-
Re: [I] feat: Extend single-value NDV optimization to string types [datafusion]
via GitHub
-
Re: [PR] Track `Parens<T>`'s span [datafusion-sqlparser-rs]
via GitHub
-
[PR] chore(deps): bump github/codeql-action from 4.35.1 to 4.35.2 [datafusion-comet]
via GitHub
-
Re: [PR] feat: support '>', '<', '>=', '<=', '<>' in all operator [datafusion]
via GitHub
-
[I] Initialize TopK from file / rowgroup / .. statistics [datafusion]
via GitHub
-
Re: [PR] feat: change approx percentile/median UDFs to return floats [datafusion]
via GitHub
-
[PR] Fix clippy 1.95 lint errors [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] Fix non-deterministic iteration in SessionStateBuilder [datafusion]
via GitHub
-
[PR] feat: Add native support for mode fn [datafusion-comet]
via GitHub
-
[I] Add native support for MODE aggregate function [datafusion-comet]
via GitHub
-
Re: [I] Optimize substr() to avoid copying for Utf8, LargeUtf8 [datafusion]
via GitHub
-
Re: [I] Arithmetic type coercion fails for RunEndEncoded columns [datafusion]
via GitHub
-
Re: [I] Optimize `left`, `right` to avoid copying for `Utf8`, `LargeUtf8` input [datafusion]
via GitHub
-
Re: [I] Implement `DFExtensionType` for Arrow's Canonical Extension Types [datafusion]
via GitHub
-
[I] Support volatile functions and scalar variables in ListingTable partition pruning [datafusion]
via GitHub
-
[PR] chore: Fix Clippy issues with Rust 1.95.0 [datafusion-ballista]
via GitHub
-
[PR] fix: rewrite concat(array, ...) to array_concat [datafusion]
via GitHub
-
[PR] perf: another ExternalSorter refactor [datafusion]
via GitHub
-
Re: [PR] fix: insert placeholder type inference showing wrong type when there is function wrapped placeholder (unknown type) [datafusion]
via GitHub
-
Re: [I] optimize_projections fails after mark-join involved [datafusion]
via GitHub
-
Re: [PR] fix: `optimize_projections` failure after mark joins created by `EXISTS OR EXISTS` [datafusion]
via GitHub
-
[PR] Add native support for max_by and min_by [datafusion-comet]
via GitHub
-
Re: [PR] feat(clickhouse): support PARTITION BY after ORDER BY and ARRAY JOIN [datafusion-sqlparser-rs]
via GitHub
-
Re: [I] SortMergeJoin with timestamp fix [datafusion-comet]
via GitHub
-
Re: [PR] feat: refactor percentiles to TypeSignature, coerce to floats [datafusion]
via GitHub
-
[PR] feat: add PostgreSQL EXCLUDE constraint parsing [datafusion-sqlparser-rs]
via GitHub
-
[PR] fix: import from `datafusion_expr` in `make_valid_utf8` [datafusion]
via GitHub
-
[PR] chore: Backport 53.1.0 changelog [datafusion]
via GitHub
-
[I] `datafusion-spark` fails to build standalone with `-p datafusion-spark` [datafusion]
via GitHub
-
Re: [I] [EPIC] [DISCUSS] Comet timezone handling [datafusion-comet]
via GitHub
-
Re: [I] Spark SQL test failures due to timestamp mismatch when LocalTableScan is native [datafusion-comet]
via GitHub
-
[PR] fix: remove spurious .flatten call that garbled SortMergeJoin fallback messages [datafusion-comet]
via GitHub
-
[I] Garbled SortMergeJoin fallback reason: `[COMET: e, s, n, j, y, T, t, u, U, a, m, i, …]` [datafusion-comet]
via GitHub
-
Re: [I] Garbled SortMergeJoin fallback reason: `[COMET: e, s, n, j, y, T, t, u, U, a, m, i, …]` [datafusion-comet]
via GitHub
-
Re: [I] Garbled SortMergeJoin fallback reason: `[COMET: e, s, n, j, y, T, t, u, U, a, m, i, …]` [datafusion-comet]
via GitHub
-
[I] Cleanup / overhaul `StringViewArrayBuilder` and related types [datafusion]
via GitHub
-
Re: [I] [Bug] BinaryView/StringView columns are spilled without GC and results in enormous spill files [datafusion]
via GitHub
-
[I] Add tests for spill file sizes [datafusion]
via GitHub
-
Re: [I] Use bitmap for count_distinct expression for u8/16 and i8/16 [perf] [datafusion]
via GitHub
-
Re: [I] update rat check to exclude stability files [datafusion-comet]
via GitHub
-
[PR] debug: assert columnar-to-row transitions have columnar children [datafusion-comet]
via GitHub
-
Re: [I] [DISCUSSION] Sorts being removed from subqueries [datafusion]
via GitHub
-
[PR] Prefetch morsels across files in FileStream (bounded at 20) [datafusion]
via GitHub
-
[PR] test: Add test for computing min/max values for expresions [datafusion]
via GitHub
-
[PR] chore: Point to the `opendal` revision where perf fixed [datafusion-comet]
via GitHub
-
Re: [I] Comet 0.15.0 Release [datafusion-comet]
via GitHub
-
[PR] fix: exclude tpcds-plan-stability extended.txt files from rat license check [datafusion-comet]
via GitHub
-
[PR] docs: clarify Maven staging behavior across release candidates [datafusion-comet]
via GitHub
-
[PR] feat: support `sort_array` [datafusion-comet]
via GitHub
-
[PR] docs: update Iceberg docs to reflect capabilities [datafusion-comet]
via GitHub
-
Re: [I] Deprecate `AggregateUDFImpl::is_nullable` in favour of `return_field` [datafusion]
via GitHub
-
[PR] deps: [DO-NOT-MERGE] test apache/datafusion/21680 [datafusion-comet]
via GitHub
-
[PR] fix: try again to fix Miri in ParquetOpener [datafusion]
via GitHub
-
[PR] perf(repartition): use SPSC channels + select_all in non-preserve-order mode [datafusion]
via GitHub
-
[PR] perf(repartition): batch reservation + sends via per-partition local buffers [datafusion]
via GitHub
-
[PR] Remove redundant Mutex from SharedMemoryReservation [datafusion]
via GitHub
-
Re: [PR] feat: enable native Iceberg reader by default [datafusion-comet]
via GitHub
-
[PR] Cherry pick Wire up with_new_state with DataSource #20718 [datafusion]
via GitHub
-
Re: [I] bug: native Iceberg reader can return wrong results for migrated Parquet files with INT96 timestamps [datafusion-comet]
via GitHub
-
Re: [I] bug: native Iceberg reader errors on residual filter on column after nested type for migrated Parquet files [datafusion-comet]
via GitHub
-
[PR] feat: add sort_pushdown_inexact benchmark for RG reorder [datafusion]
via GitHub