github
Thread
Date
Earlier messages
Messages by Thread
[PR] test: add sql-file test confirming fallback on parquet variant reads [datafusion-comet]
via GitHub
[PR] Optimize Dictionary groupings [datafusion]
via GitHub
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
[I] Remove `native_iceberg_compat` scan [datafusion-comet]
via GitHub
[PR] Add support for nested types to nullif. [datafusion]
via GitHub
[I] Add support for nested types to `nullif`. [datafusion]
via GitHub
[PR] chore: Remove config option for `native_iceberg_compat` [datafusion-comet]
via GitHub
[PR] docs: Add documentation on Spark 4 limitations [datafusion-comet]
via GitHub
Re: [PR] docs: Add documentation on Spark 4 limitations [datafusion-comet]
via GitHub
Re: [I] [EPIC] Add support for Spark 4.0 [datafusion-comet]
via GitHub
Re: [PR] DataFrame API: allow aggregate functions in select() (#17874) [datafusion]
via GitHub
Re: [PR] DataFrame API: allow aggregate functions in select() (#17874) [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
Re: [PR] Introduce morsel-driven Parquet scan [datafusion]
via GitHub
[PR] Improve ergonomics for ExecutionPlanMetricsSet and MetricsSet [datafusion]
via GitHub
Re: [PR] Snowflake: Add support for text data type modifiers [datafusion-sqlparser-rs]
via GitHub
Re: [PR] PostgreSQL `UNLOGGED` Table Support and `ALTER TABLE ... SET LOGGED|UNLOGGED` [datafusion-sqlparser-rs]
via GitHub
[PR] fix: Fix local `datafusion-cli` test failure [datafusion]
via GitHub
Re: [PR] fix: Fix local `datafusion-cli` test failure [datafusion]
via GitHub
[PR] fix: substring with negative start index [datafusion-comet]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
[PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 in the all-other-cargo-deps group [datafusion]
via GitHub
Re: [PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 in the all-other-cargo-deps group [datafusion]
via GitHub
[PR] chore(deps): bump astral-sh/setup-uv from 8.0.0 to 8.1.0 [datafusion]
via GitHub
Re: [PR] chore(deps): bump astral-sh/setup-uv from 8.0.0 to 8.1.0 [datafusion]
via GitHub
[PR] chore(deps): bump github/codeql-action from 4.35.1 to 4.35.2 [datafusion]
via GitHub
Re: [PR] chore(deps): bump github/codeql-action from 4.35.1 to 4.35.2 [datafusion]
via GitHub
[PR] chore(deps): bump taiki-e/install-action from 2.75.10 to 2.75.18 [datafusion]
via GitHub
Re: [PR] chore(deps): bump taiki-e/install-action from 2.75.10 to 2.75.18 [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: statistics-driven TopK optimization for parquet (file reorder + RG reorder + threshold init + cumulative prune) [datafusion]
via GitHub
Re: [PR] feat: add DROP pipe operator support [datafusion]
via GitHub
[PR] Skip files outside partition structure in hive-partitioned listing tables [datafusion]
via GitHub
Re: [PR] Skip files outside partition structure in hive-partitioned listing tables [datafusion]
via GitHub
[I] Hive-partitioned listing table crashes when root directory contains non-partitioned files [datafusion]
via GitHub
[PR] feat: Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
Re: [PR] feat: Support `EXPLAIN ANALYZE` in Ballista [datafusion-ballista]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
Re: [PR] feat: estimate cardinality for semi and anti-joins using distinct counts [datafusion]
via GitHub
[PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
Re: [PR] perf(substr_index): speed up scalar and Utf8View [datafusion]
via GitHub
Re: [I] Expose used `MemoryPool` details in `ResourcesExhausted` error messages [datafusion]
via GitHub
[I] Add `used` memory size to `FairSpillPool` [datafusion]
via GitHub
Re: [I] Add `used` memory size to `FairSpillPool` [datafusion]
via GitHub
[PR] chore: Start 0.16 development [datafusion-comet]
via GitHub
Re: [PR] chore: Start 0.16 development [datafusion-comet]
via GitHub
[PR] fix: allow safe mixed Spark/Comet partial/final aggregate execution [datafusion-comet]
via GitHub
Re: [PR] fix: allow safe mixed Spark/Comet partial/final aggregate execution [datafusion-comet]
via GitHub
[PR] chore(deps): bump mimalloc from 0.1.48 to 0.1.49 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump mimalloc from 0.1.48 to 0.1.49 [datafusion-ballista]
via GitHub
[PR] chore(deps): bump rand from 0.9.4 to 0.10.1 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump rand from 0.9.4 to 0.10.1 [datafusion-ballista]
via GitHub
[PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 [datafusion-ballista]
via GitHub
Re: [PR] chore(deps): bump aws-config from 1.8.15 to 1.8.16 [datafusion-ballista]
via GitHub
[I] How to limit the maximum memory usage of the ballista executor [datafusion-ballista]
via GitHub
Re: [I] How to limit the maximum memory usage of the ballista executor? [datafusion-ballista]
via GitHub
Re: [PR] feat: Add support for `RegExpExtract`/`RegExpExtractAll` [datafusion-comet]
via GitHub
[I] Exchange reuse broken when CometExecRule converts BroadcastExchangeExec after ReuseExchangeAndSubquery [datafusion-comet]
via GitHub
Re: [I] Exchange reuse broken when CometExecRule converts BroadcastExchangeExec after ReuseExchangeAndSubquery [datafusion-comet]
via GitHub
Re: [PR] feat: [iceberg] Pass table master key ID to native scan [datafusion-comet]
via GitHub
Re: [PR] Timezone aware extract SQL expression [datafusion]
via GitHub
Re: [PR] Prefetch Row Groups using `next_reader` API in parquet-rs [datafusion]
via GitHub
[PR] test: run more Spark 4 tests [datafusion-comet]
via GitHub
[PR] chore: update compatibility guide for primitive to string casts [datafusion-comet]
via GitHub
Re: [PR] chore: update compatibility guide for primitive to string casts [datafusion-comet]
via GitHub
Re: [PR] chore: update compatibility guide for primitive to string casts [datafusion-comet]
via GitHub
[PR] feat: support non-AQE Dynamic Partition Pruning for CometNativeScanExec (Parquet V1) [datafusion-comet]
via GitHub
Re: [I] Return `NativeType` instead of `DataType` for `get_example_types` [datafusion]
via GitHub
[PR] perf: avoid JVM shuffle when sandwiched between non-Comet operators [datafusion-comet]
via GitHub
Re: [PR] perf: avoid JVM shuffle when sandwiched between non-Comet operators [datafusion-comet]
via GitHub
Re: [PR] perf: avoid JVM shuffle when sandwiched between non-Comet operators [WIP] [datafusion-comet]
via GitHub
Re: [PR] perf: avoid JVM shuffle when sandwiched between non-Comet operators [WIP] [datafusion-comet]
via GitHub
[PR] chore: complete coverage for primitive to string casts [datafusion-comet]
via GitHub
Re: [PR] chore: complete coverage for primitive to string casts [datafusion-comet]
via GitHub
Re: [PR] chore: complete coverage for primitive to string casts [datafusion-comet]
via GitHub
[PR] fix: cast to and from timestamp_ntz [datafusion-comet]
via GitHub
[I] Enable windowed aggregates and fix correctness issues [datafusion-comet]
via GitHub
[PR] fix: add explicit sort for window aggregates to fix correctness issues [datafusion-comet]
via GitHub
Re: [PR] fix: add explicit sort for window aggregates to fix correctness issues [datafusion-comet]
via GitHub
Re: [PR] fix: add explicit sort for window aggregates to fix correctness issues [datafusion-comet]
via GitHub
[PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Filter pushdown dynamic bytes morsels [datafusion]
via GitHub
Re: [PR] Dynamic filter scheduling during filter pushdown [datafusion]
via GitHub
Re: [PR] Dynamic filter scheduling during filter pushdown [datafusion]
via GitHub
Re: [PR] Dynamic filter scheduling during filter pushdown [datafusion]
via GitHub
Re: [PR] Dynamic filter scheduling during filter pushdown [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
Re: [PR] Adaptive filter scheduling for Parquet scans [datafusion]
via GitHub
[I] Reduce allocation churn in the optimizer [datafusion]
via GitHub
Re: [I] Reduce allocation churn in the optimizer [datafusion]
via GitHub
Re: [I] Reduce allocation churn in tree rewriting [datafusion]
via GitHub
[I] Add distinction between "info" and "fallback" messages [datafusion-comet]
via GitHub
Re: [PR] Blog: Row-Level DML in DataFusion [datafusion-site]
via GitHub
Re: [I] Integrate collect_set to Comet [datafusion-comet]
via GitHub
[I] [EPIC] Improve Comet planning [datafusion-comet]
via GitHub
Re: [I] [EPIC] Improve Comet planning [datafusion-comet]
via GitHub
[I] Avoid JVM shuffle when parent stage will just convert back to rows [datafusion-comet]
via GitHub
Re: [I] Reduce off-heap memory requirements TPC 1TB benchmarks [datafusion-comet]
via GitHub
Re: [I] Reduce off-heap memory requirements TPC 1TB benchmarks [datafusion-comet]
via GitHub
[PR] perf: Avoid `Box` and `Arc` allocation churn in the planner [datafusion]
via GitHub
Re: [PR] perf: Avoid `Box` and `Arc` allocation churn in the planner [datafusion]
via GitHub
Re: [PR] perf: Avoid `Box` and `Arc` allocation churn in the planner [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
Re: [PR] perf: Reduce `Box` and `Arc` allocation churn during tree rewriting [datafusion]
via GitHub
[PR] test: add tests for spill file sizes to verify View GC [datafusion]
via GitHub
Re: [PR] test: add tests for spill file sizes to verify View GC [datafusion]
via GitHub
Earlier messages