github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/04
Re: [PR] feat: support size() for MapType input [datafusion-comet]
via GitHub
2026/06/04
Re: [I] Add csharp support [datafusion]
via GitHub
2026/06/04
Re: [I] Unambiguous Column Reference Error Not Triggered in SQL Query [datafusion]
via GitHub
2026/06/04
Re: [I] SMALLINT addition overflow should error instead of silently wrapping [datafusion]
via GitHub
2026/06/04
Re: [I] Support Expanding OR Conditions in INNER JOIN into Multiple Mutually Exclusive Branches [datafusion]
via GitHub
2026/06/04
Re: [PR] test: benchmarks and SLT tests for push-down TopK through join [datafusion]
via GitHub
2026/06/04
Re: [I] Concurrent cache misses on the same path trigger duplicate metadata loads in FileMetadataCache [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: remove summary frontmatter from Comet EKS blog post [datafusion-site]
via GitHub
2026/06/04
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/06/04
Re: [PR] test: benchmarks and SLT tests for push-down TopK through join [datafusion]
via GitHub
2026/06/04
[PR] fix: remove summary frontmatter from Comet EKS blog post [datafusion-site]
via GitHub
2026/06/04
Re: [PR] feat: Add Native Support for In-Memory Cache [datafusion-comet]
via GitHub
2026/06/04
Re: [I] Explore options for accelerating InMemoryTableScanExec [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] Add optimize_with_context to FFI_PhysicalOptimizerRule [datafusion]
via GitHub
2026/06/04
Re: [PR] Add optimize_with_context to FFI_PhysicalOptimizerRule [datafusion]
via GitHub
2026/06/04
Re: [PR] Add optimize_with_context to FFI_PhysicalOptimizerRule [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: Add Native Support for In-Memory Cache [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] Add optimize_with_context to FFI_PhysicalOptimizerRule [datafusion]
via GitHub
2026/06/04
[PR] Add native support for in-memory cache [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] docs: revise OptimizerRule trait method descriptions [datafusion]
via GitHub
2026/06/04
Re: [PR] docs: revise OptimizerRule trait method descriptions [datafusion]
via GitHub
2026/06/04
Re: [PR] docs: revise OptimizerRule trait method descriptions [datafusion]
via GitHub
2026/06/04
Re: [PR] Add map_transform UDF [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support size() for MapType input [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] feat: support size() for MapType input [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [I] Join per-column stats not scaled with estimated row count [datafusion]
via GitHub
2026/06/04
[I] Join per-column stats not scaled with estimated row count [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: Skip `fork` and `forkserver` on `win32` [datafusion-python]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] test: add timestamp ntz array cast coverage [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] bench: add predicate_eval SQL micro-benchmark suite for conjunctive filter evaluation [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] perf: Reorder predicates in conjuncts via simple heuristic [datafusion]
via GitHub
2026/06/04
Re: [I] Reorder boolean expressions (including filter predicates) according to evaluation cost / selectivity [datafusion]
via GitHub
2026/06/04
Re: [PR] Add `array_product` UDF [datafusion]
via GitHub
2026/06/04
Re: [PR] docs: revise OptimizerRule trait method descriptions [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] Add declared file scan output partitioning [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: add max_row_group_bytes option to ParquetOptions [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
[I] Comet native Iceberg scan duplicates rows when splitting a single-row-group Parquet file into multiple byte-range tasks [datafusion-comet]
via GitHub
2026/06/04
Re: [I] NestedLoopJoinExec spill path: untracked allocation overshoots memory pool [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [I] NestedLoopJoinExec spill path: untracked allocation overshoots memory pool [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: preserve timestamp precision when coercing mixed time units [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: UNIQUE constraint with NULLs incorrectly collapses GROUP BY groups [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [PR] fix generate_series table function overflows [datafusion]
via GitHub
2026/06/04
[PR] test: benchmarks and SLT tests for push-down TopK through join [datafusion]
via GitHub
2026/06/04
Re: [PR] fix factorial(21) should not overflow when PostgreSQL returns a numeric answer [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix log(0.0::float8) should error, not return -inf [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: SMALLINT addition overflow should error instead of silently wrapping [datafusion]
via GitHub
2026/06/04
Re: [I] Incorrect timestamp comparison with mixed time units [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: raise AmbiguousReference error for duplicate column names in subquery [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: UNIQUE constraint with NULLs incorrectly collapses GROUP BY groups [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: Support Expanding OR Conditions in INNER JOIN into Multiple Mutually Exclusive Branches [datafusion]
via GitHub
2026/06/04
[PR] fix: timestamp comparisons to coerce to finer unit [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix DataFusion drops grouped MIN/MAX rows with NULL sort keys under ORDER BY + LIMIT [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix volatile scalar subquery deduplication [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix volatile scalar subquery deduplication [datafusion]
via GitHub
2026/06/04
Re: [PR] chore(docker): Use yellow_tripdata_2022-01.parquet from nightlies.apache.org [datafusion-ballista]
via GitHub
2026/06/04
Re: [I] panic: date_bin overflows scaling extreme Timestamp(Second) source [datafusion]
via GitHub
2026/06/04
Re: [PR] fix date_bin overflows scaling extreme Timestamp(Second) source [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: add DataFrame fill_nan [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [I] Incorrect timestamp comparison with mixed time units [datafusion]
via GitHub
2026/06/04
Re: [PR] Optimize arrays_zip for perfectly aligned arrays [datafusion]
via GitHub
2026/06/04
Re: [PR] Optimize arrays_zip for perfectly aligned arrays [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support Spark expression json_array_length [datafusion-comet]
via GitHub
2026/06/04
Re: [I] Extraneous casts added due to overly strict type comparison [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support Boolean in approx_distinct [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: avoid extraneous casts for equivalent nested types [datafusion]
via GitHub
2026/06/04
Re: [I] PostgreSQL compatibility: `replace` with an empty search string should be a no-op [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: make replace empty search string a no-op [datafusion]
via GitHub
2026/06/04
Re: [PR] perf: optimize arrays_zip perfect list zips [datafusion]
via GitHub
2026/06/04
Re: [I] `power(decimal, -integer)` is incorrect [datafusion]
via GitHub
2026/06/04
Re: [I] `power(decimal, integer)` overflow for moderate exponents [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: Remove `power(decimal, int)` code path [datafusion]
via GitHub
2026/06/04
Re: [I] Optimize `arrays_zip` to avoid row-by-row copying in the perfect-zip case [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: replace with empty search string should be a no-op [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: Skip `fork` and `forkserver` on `win32` [datafusion-python]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [I] NestedLoopJoinExec spill path: untracked allocation overshoots memory pool [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: Skip `fork` and `forkserver` on `win32` [datafusion-python]
via GitHub
2026/06/04
Re: [PR] feat: import user-defined physical optimizer rules over FFI [datafusion-python]
via GitHub
2026/06/04
Re: [I] Expose physical plan optimizer [datafusion-python]
via GitHub
2026/06/04
Re: [PR] feat: import user-defined physical optimizer rules over FFI [datafusion-python]
via GitHub
2026/06/04
Re: [PR] perf: Reorder predicates in conjuncts via simple heuristic [datafusion]
via GitHub
2026/06/04
Re: [I] Concurrent cache misses on the same path trigger duplicate metadata loads in FileMetadataCache [datafusion]
via GitHub
2026/06/04
Re: [PR] feat(aggregate): cost-aware partial-aggregation skip (opt-in) [datafusion]
via GitHub
2026/06/04
Re: [I] Hash aggregation produces batches reporting huge memory size [datafusion]
via GitHub
2026/06/04
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/06/04
Re: [PR] perf(optimizer): EliminateCrossJoin fast-path for join-free plans [datafusion]
via GitHub
2026/06/04
Re: [I] Concurrent cache misses on the same path trigger duplicate metadata loads in FileMetadataCache [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: add max_row_group_bytes option to ParquetOptions [datafusion]
via GitHub
2026/06/04
Re: [PR] chore(docker): Use yellow_tripdata_2022-01.parquet from nightlies.apache.org [datafusion-ballista]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Split hash aggregation logic into separated streams [datafusion]
via GitHub
2026/06/04
[PR] fix: Skip `fork` and `forkserver` on `win32` [datafusion-python]
via GitHub
2026/06/04
Re: [PR] perf(logical-plan): box CreateExternalTable / CreateFunction in DdlStatement (-45% LogicalPlan size) [datafusion]
via GitHub
2026/06/04
Re: [I] Hash aggregation produces batches reporting huge memory size [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: implement retract_batch for array_agg(DISTINCT) sliding window [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [I] Downstream consumers of AggregateExec significantly overcount memory usage [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [I] Downstream consumers of AggregateExec significantly overcount memory usage [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: add max_row_group_bytes option to ParquetOptions [datafusion]
via GitHub
2026/06/04
Re: [I] Downstream consumers of AggregateExec significantly overcount memory usage [datafusion]
via GitHub
2026/06/04
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/04
Re: [I] Incorrect memory accounting in `array_agg` function [datafusion]
via GitHub
2026/06/04
Re: [I] Incorrect memory accounting in `array_agg` function [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [I] Fix memory accounting in Datafusion [datafusion]
via GitHub
2026/06/04
Re: [I] Hash aggregation produces batches reporting huge memory size [datafusion]
via GitHub
2026/06/04
Re: [I] Incorrect memory accounting in `array_agg` function [datafusion]
via GitHub
2026/06/04
Re: [I] [DISCUSSION] Memory accounting model discussion [datafusion]
via GitHub
2026/06/04
Re: [I] Downstream consumers of AggregateExec significantly overcount memory usage [datafusion]
via GitHub
2026/06/04
Re: [I] More accurate memory accounting in external sort [datafusion]
via GitHub
2026/06/04
Re: [I] GroupValuesRows::emit untracked decode buffer leaks past MemoryReservation [datafusion]
via GitHub
2026/06/04
Re: [I] NestedLoopJoinExec spill path: untracked allocation overshoots memory pool [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [PR] feat(aqe): empty join handling [datafusion-ballista]
via GitHub
2026/06/04
[I] [EPIC] A collection of Memory Accounting Limitations and Improvements [datafusion]
via GitHub
2026/06/04
Re: [I] NestedLoopJoinExec spill path: untracked allocation overshoots memory pool [datafusion]
via GitHub
2026/06/04
Re: [PR] Add declared file scan output partitioning [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [I] Split Aggregation Logic into Dedicated Streams [datafusion]
via GitHub
2026/06/04
Re: [I] Split Aggregation Logic into Dedicated Streams [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix correlated EXISTS with constant-false predicate produces dangling mark column reference [datafusion]
via GitHub
2026/06/04
Re: [PR] perf(logical-plan): box CreateExternalTable / CreateFunction in DdlStatement (-45% LogicalPlan size) [datafusion]
via GitHub
2026/06/04
[PR] test: add timestamp ntz array cast coverage [datafusion-comet]
via GitHub
2026/06/04
Re: [PR] perf: Reorder predicates in conjuncts via simple heuristic [datafusion]
via GitHub
2026/06/04
Re: [PR] perf: Reorder predicates in conjuncts via simple heuristic [datafusion]
via GitHub
2026/06/04
Re: [PR] fix date_bin overflows scaling extreme Timestamp(Second) source [datafusion]
via GitHub
2026/06/04
Re: [I] GroupValuesRows::emit untracked decode buffer leaks past MemoryReservation [datafusion]
via GitHub
2026/06/04
Re: [I] GroupValuesRows::emit untracked decode buffer leaks past MemoryReservation [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: avoid extraneous casts for equivalent nested types [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support Boolean in approx_distinct [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: Remove `power(decimal, int)` code path [datafusion]
via GitHub
2026/06/04
Re: [PR] perf: optimize arrays_zip perfect list zips [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: replace with empty search string should be a no-op [datafusion]
via GitHub
2026/06/04
[I] Downstream consumers of AggregateExec significantly overcount memory usage [datafusion]
via GitHub
2026/06/04
Re: [PR] Add `array_product` UDF [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix log(0.0::float8) should error, not return -inf [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix log(0.0::float8) should error, not return -inf [datafusion]
via GitHub
2026/06/04
Re: [PR] Add optimize_with_context to FFI_PhysicalOptimizerRule [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix log(0.0::float8) should error, not return -inf [datafusion]
via GitHub
2026/06/04
Re: [PR] Fix log(0.0::float8) should error, not return -inf [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: add DataFrame fill_nan [datafusion]
via GitHub
2026/06/04
[I] Incorrect timestamp comparison with mixed time units [datafusion]
via GitHub
2026/06/04
Re: [PR] fix: regex simplification of anchored patterns produces wrong results [datafusion]
via GitHub
2026/06/04
Re: [PR] Add declared file scan output partitioning [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
[PR] fix: handle NULLs in sliding SUM(DISTINCT) window frames [datafusion]
via GitHub
2026/06/04
Re: [I] Semi/anti join column stats not scaled with estimated row count [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support size() for MapType input (#4472) [datafusion-comet]
via GitHub
2026/06/04
[I] SUM(DISTINCT) over a bounded window frame ignores NULLs' validity mask and returns 0 instead of NULL [datafusion]
via GitHub
2026/06/04
Re: [PR] test: initial repro test by codex [datafusion]
via GitHub
2026/06/04
Re: [PR] test: initial repro test by codex [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Update SortMergeJoin to use async spill abstractions [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Update SortMergeJoin to use async spill abstractions [datafusion]
via GitHub
2026/06/04
[PR] test: initial repro test by codex [datafusion]
via GitHub
2026/06/04
Re: [PR] refactor: Update SortMergeJoin to use async spill abstractions [datafusion]
via GitHub
2026/06/04
Re: [PR] feat: support size() for MapType input (#4472) [datafusion-comet]
via GitHub
2026/06/03
Re: [PR] refactor: Update SortMergeJoin to use async spill abstractions [datafusion]
via GitHub
2026/06/03
Re: [PR] refactor(physical-plan): extract make_group_column factory + tighten Time variants + dhat harness [datafusion]
via GitHub
2026/06/03
Re: [PR] feat: implement retract_batch for array_agg(DISTINCT) sliding window [datafusion]
via GitHub
2026/06/03
Re: [PR] Fix volatile scalar subquery deduplication [datafusion]
via GitHub
2026/06/03
Re: [PR] fix generate_series table function overflows [datafusion]
via GitHub
2026/06/03
[PR] chore(docker): Use yellow_tripdata_2022-01.parquet from nightlies.apache.org [datafusion-ballista]
via GitHub
2026/06/03
[PR] Chore skip agg [datafusion]
via GitHub
2026/06/03
Re: [PR] Fix volatile scalar subquery deduplication [datafusion]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump astral-sh/setup-uv from 8.1.0 to 8.2.0 [datafusion]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump aws-config from 1.8.17 to 1.8.18 [datafusion-ballista]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump actions/checkout from 6.0.2 to 6.0.3 [datafusion]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump taiki-e/install-action from 2.79.8 to 2.81.3 [datafusion]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump github/codeql-action from 4.36.0 to 4.36.1 [datafusion]
via GitHub
2026/06/03
Re: [PR] chore(deps): bump taiki-e/install-action from 2.79.9 to 2.81.3 [datafusion-ballista]
via GitHub
2026/06/03
Re: [PR] feat: implement Spark-compatible weekday function [datafusion]
via GitHub
Earlier messages
Later messages