github
Thread
Date
Earlier messages
Messages by Date
2026/04/21
[PR] feat(parquet): row-group morselization for sibling FileStream stealing [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] docs: Add documentation on Spark 4 limitations [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Snowflake: Add support for text data type modifiers [datafusion-sqlparser-rs]
via GitHub
2026/04/21
Re: [PR] Comet 0.15.0 blog post [datafusion-site]
via GitHub
2026/04/21
Re: [PR] Comet 0.15.0 blog post [datafusion-site]
via GitHub
2026/04/21
[PR] chore: skip Iceberg and Spark SQL test workflows on test-only changes [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
[I] perf: Iceberg DPP executes dim table broadcast twice instead of reusing join's broadcast exchange [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Add quote style and trimming to csv writier [datafusion]
via GitHub
2026/04/21
Re: [PR] chore: backport version from `branch-53`, update some dependencies [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
2026/04/21
Re: [PR] chore: update compatibility guide for primitive to string casts [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] chore: update compatibility guide for primitive to string casts [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
[PR] test: add sql-file test confirming fallback on parquet variant reads [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/04/21
Re: [PR] Rich t kid/dictionary encoding hash optmize [datafusion]
via GitHub
2026/04/21
Re: [PR] Rich t kid/dictionary encoding hash optmize [datafusion]
via GitHub
2026/04/21
Re: [PR] Nested structure support [datafusion]
via GitHub
2026/04/21
[PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/04/21
Re: [PR] Add SQL based benchmarking [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
2026/04/21
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/21
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/21
[I] Remove `native_iceberg_compat` scan [datafusion-comet]
via GitHub
2026/04/21
[PR] Add support for nested types to nullif. [datafusion]
via GitHub
2026/04/21
[I] Add support for nested types to `nullif`. [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
[PR] chore: Remove config option for `native_iceberg_compat` [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [I] Exchange reuse broken when CometExecRule converts BroadcastExchangeExec after ReuseExchangeAndSubquery [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Support Dictionary Arrays in MIN/MAX Aggregates [datafusion]
via GitHub
2026/04/21
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/21
Re: [PR] chore(deps): bump taiki-e/install-action from 2.75.10 to 2.75.18 [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] docs: Add documentation on Spark 4 limitations [datafusion-comet]
via GitHub
2026/04/21
[PR] docs: Add documentation on Spark 4 limitations [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 in the all-other-cargo-deps group [datafusion]
via GitHub
2026/04/21
Re: [PR] chore(deps): bump github/codeql-action from 4.35.1 to 4.35.2 [datafusion]
via GitHub
2026/04/21
Re: [PR] fix: Fix local `datafusion-cli` test failure [datafusion]
via GitHub
2026/04/21
Re: [PR] chore(deps): bump astral-sh/setup-uv from 8.0.0 to 8.1.0 [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [I] [EPIC] Add support for Spark 4.0 [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] DataFrame API: allow aggregate functions in select() (#17874) [datafusion]
via GitHub
2026/04/21
Re: [PR] DataFrame API: allow aggregate functions in select() (#17874) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
2026/04/21
Re: [PR] Handle canceled partitioned hash join dynamic filters lazily [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/21
Re: [PR] Add ExpressionAnalyzer for pluggable expression-level statistics estimation [datafusion]
via GitHub
2026/04/21
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/21
[PR] Improve ergonomics for ExecutionPlanMetricsSet and MetricsSet [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] chore: Start 0.16 development [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] fix(aggregate): show aliased expr in explain [datafusion]
via GitHub
2026/04/21
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/21
Re: [PR] Snowflake: Add support for text data type modifiers [datafusion-sqlparser-rs]
via GitHub
2026/04/21
Re: [PR] PostgreSQL `UNLOGGED` Table Support and `ALTER TABLE ... SET LOGGED|UNLOGGED` [datafusion-sqlparser-rs]
via GitHub
2026/04/21
Re: [PR] fix: Validate spill read schema [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] refactor: Simplify NLJ re-scans with `ReplayableStreamSource` [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
[PR] fix: Fix local `datafusion-cli` test failure [datafusion]
via GitHub
2026/04/21
[PR] fix: substring with negative start index [datafusion-comet]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/04/21
Re: [PR] refactor: Simplify NLJ re-scans with `ReplayableStreamSource` [datafusion]
via GitHub
2026/04/21
[PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 in the all-other-cargo-deps group [datafusion]
via GitHub
2026/04/21
[PR] chore(deps): bump astral-sh/setup-uv from 8.0.0 to 8.1.0 [datafusion]
via GitHub
2026/04/21
Re: [PR] [Minor]: unify ANY/ALL planning and align ANY NULL semantics with PG [datafusion]
via GitHub
2026/04/21
Re: [I] How to limit the maximum memory usage of the ballista executor? [datafusion-ballista]
via GitHub
2026/04/21
[PR] chore(deps): bump github/codeql-action from 4.35.1 to 4.35.2 [datafusion]
via GitHub
2026/04/21
[PR] chore(deps): bump taiki-e/install-action from 2.75.10 to 2.75.18 [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] Support Dictionary Arrays in MIN/MAX Aggregates [datafusion]
via GitHub
2026/04/21
Re: [PR] refactor: Simplify NLJ re-scans with `ReplayableStreamSource` [datafusion]
via GitHub
2026/04/21
Re: [PR] refactor: Simplify NLJ re-scans with `ReplayableStreamSource` [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] Support Dictionary Arrays in MIN/MAX Aggregates [datafusion]
via GitHub
2026/04/21
Re: [PR] fix: Validate spill read schema [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] fix(aggregate): show aliased expr in explain [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [I] chore: Create CI action that validates links in md files [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] fix: UNIQUE constraint with NULLs incorrectly collapses GROUP BY groups [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] perf(aggregate): pin build phase to a single thread [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: add DROP pipe operator support [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/21
Re: [PR] Skip files outside partition structure in hive-partitioned listing tables [datafusion]
via GitHub
2026/04/21
[PR] Skip files outside partition structure in hive-partitioned listing tables [datafusion]
via GitHub
2026/04/21
[I] Hive-partitioned listing table crashes when root directory contains non-partitioned files [datafusion]
via GitHub
2026/04/21
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/21
Re: [PR] Add lambda support and array_transform udf [datafusion]
via GitHub
2026/04/20
[PR] feat: Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
2026/04/20
Re: [PR] Support Dictionary Arrays in MIN/MAX Aggregates [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [I] Reduce allocation churn in tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [I] plan_to_sql drops window expressions for Window(Aggregate) plans without Projection [datafusion]
via GitHub
2026/04/20
Re: [PR] fix: Validate spill read schema [datafusion]
via GitHub
2026/04/20
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
[PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [I] Binary string (`BYTEA`, `Binary`) concatenation [datafusion]
via GitHub
2026/04/20
Re: [PR] chore(deps): bump rand from 0.9.4 to 0.10.1 [datafusion-ballista]
via GitHub
2026/04/20
Re: [I] Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [PR] fix: array_concat widens container variant for mixed List/LargeList inputs [datafusion]
via GitHub
2026/04/20
Re: [I] array_concat fails with internal error on mixed List + LargeList inputs [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [I] Add `used` memory size to `FairSpillPool` [datafusion]
via GitHub
2026/04/20
[I] Add `used` memory size to `FairSpillPool` [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] fix: array_concat widens container variant for mixed List/LargeList inputs [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
2026/04/20
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/20
Re: [PR] added support for MapFromEntries [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] fix: render binary columns as hex in DataFrame::describe() [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
2026/04/20
Re: [PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 [datafusion-ballista]
via GitHub
2026/04/20
Re: [PR] chore(deps): bump mimalloc from 0.1.48 to 0.1.49 [datafusion-ballista]
via GitHub
2026/04/20
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/20
Re: [PR] Push down topk through join [datafusion]
via GitHub
2026/04/20
Re: [PR] feat: enable external reclaim for mem spillable df operators [datafusion]
via GitHub
2026/04/20
[PR] chore: Start 0.16 development [datafusion-comet]
via GitHub
2026/04/20
Re: [PR] fix: allow safe mixed Spark/Comet partial/final aggregate execution [datafusion-comet]
via GitHub
2026/04/20
[PR] fix: allow safe mixed Spark/Comet partial/final aggregate execution [datafusion-comet]
via GitHub
2026/04/20
[PR] chore(deps): bump mimalloc from 0.1.48 to 0.1.49 [datafusion-ballista]
via GitHub
2026/04/20
[PR] chore(deps): bump rand from 0.9.4 to 0.10.1 [datafusion-ballista]
via GitHub
2026/04/20
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init) [datafusion]
via GitHub
2026/04/20
[PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 [datafusion-ballista]
via GitHub
2026/04/20
Re: [PR] fix: report task output metrics in Spark UI [datafusion-comet]
via GitHub
2026/04/20
Re: [PR] fix: report task output metrics in Spark UI [datafusion-comet]
via GitHub
Earlier messages