github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/04/03
Re: [PR] feat: Optimize TopK for single primitive column sorts [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: Optimize TopK for single primitive column sorts [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: Optimize TopK for single primitive column sorts [datafusion]
via GitHub
2026/04/03
[PR] feat: Optimize TopK for single primitive column sorts [datafusion]
via GitHub
2026/04/03
Re: [PR] DataFrame API: allow aggregate functions in select() (#17874) [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: Add Spark-compatible `encode` function to datafusion-spark [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: Add Spark-compatible `encode` function to datafusion-spark [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
[PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Add batch pass-through optimization to SortPreservingMergeExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] fix: binary string concat [datafusion]
via GitHub
2026/04/03
[PR] TEST - troubleshoot #21315 [datafusion]
via GitHub
2026/04/03
Re: [I] Add configurable UNION DISTINCT support to FILTER rewrite optimization [datafusion]
via GitHub
2026/04/03
Re: [I] Pluggable expression-level statistics estimation (ExpressionAnalyzer) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] perf: Optimize `split_part` for scalar args [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: Add immediate mode option for native shuffle [datafusion-comet]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
[PR] chore(deps): bump the all-other-cargo-deps group in /native with 2 updates [datafusion-comet]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
[PR] chore(deps): bump github/codeql-action from 4.34.1 to 4.35.1 [datafusion-comet]
via GitHub
2026/04/03
[PR] chore(deps): bump actions/github-script from 7 to 8 [datafusion-comet]
via GitHub
2026/04/03
Re: [I] Experiment: immediate-mode shuffle [datafusion-comet]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Eliminate more redundant `ProjectionExec`s [datafusion]
via GitHub
2026/04/03
Re: [PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] fix(spark): preserve raw number text in `json_tuple` to match Spark [datafusion]
via GitHub
2026/04/03
Re: [PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: remove full path from partition locations [datafusion-ballista]
via GitHub
2026/04/03
[PR] Preserve column order in projection embedding to eliminate redundant ProjectionExec [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] fix: raise AmbiguousReference error for duplicate column names in subquery [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: remove full path from partition locations [datafusion-ballista]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
[PR] Deduplicate non-inline StringView values in GroupValuesColumn [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [I] [Feature] Support external Remote Shuffle Service (e.g., Apache Celeborn / Apache Uniffle) [datafusion-ballista]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [I] [Feature] Support external Remote Shuffle Service (e.g., Apache Celeborn / Apache Uniffle) [datafusion-ballista]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
[PR] fix: parameterize file count in Native_datafusion metrics test [datafusion-comet]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/03
Re: [PR] fix: Native_datafusion reports correct files and bytes scanned [datafusion-comet]
via GitHub
2026/04/03
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
[PR] feat: Add Spark-compatible `encode` function to datafusion-spark [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: support LEAD and LAG window functions with IGNORE NULLS [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in SortPreservingMergeExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
[PR] Defer task spawning in RepartitionExec to first poll [datafusion]
via GitHub
2026/04/02
[I] Defer task spawning in SortPreservingMergeExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: audit array_insert expression for correctness and test coverage [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] Introduce Morselizer API [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: comet native scan improvements - Dynamic Partition Pruning [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in SortPreservingMergeExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Adds INList and Between expr to skip outer join [datafusion]
via GitHub
2026/04/02
Re: [I] EliminateOuterJoin does not recognize InList and Between as null-rejecting expressions [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in SortPreservingMergeExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in SortPreservingMergeExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] Defer task spawning in CoalescePartitionsExec to first poll [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: Add immediate mode option for native shuffle [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: unify ordering display with optimization path [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
2026/04/02
[PR] fix sqlite type range mismatch [datafusion-testing]
via GitHub
2026/04/02
Re: [PR] chore: Fix clippy and CI [datafusion]
via GitHub
2026/04/02
Re: [PR] fix: disable atan2 instead of tan [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] perf: Optimize `split_part` for scalar args [datafusion]
via GitHub
2026/04/02
Re: [I] Current shuffle format has too much overhead with default batch size [datafusion-comet]
via GitHub
2026/04/02
[PR] Fix Iceberg reflection for current() on TableOperations hierarchy [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: disable atan2 instead of tan [datafusion-comet]
via GitHub
2026/04/02
[I] `NoSuchMethodException` when reflecting Iceberg TableOperations.current() [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: handle ambiguous and non-existent local times [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: audit array_insert expression for correctness and test coverage [datafusion-comet]
via GitHub
2026/04/02
[I] [Feature] Support external Remote Shuffle Service (e.g., Apache Celeborn / Apache Uniffle) [datafusion-ballista]
via GitHub
2026/04/02
Re: [I] Current shuffle format has too much overhead with default batch size [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: disable atan2 instead of tan [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: disable atan2 instead of tan [datafusion-comet]
via GitHub
2026/04/02
[PR] add EmptySchemaShufflePartitioner and test from #3858 [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] perf: Optimize `split_part` for scalar args [datafusion]
via GitHub
2026/04/02
Re: [PR] perf: Optimize `split_part` for scalar args [datafusion]
via GitHub
2026/04/02
Re: [PR] perf: Optimize `split_part` for scalar args [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: support LEAD and LAG window functions with IGNORE NULLS [datafusion-comet]
via GitHub
2026/04/02
[PR] chor: enable Corr [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] docs: add custom table provider filter pushdown examples [datafusion]
via GitHub
2026/04/02
[PR] chore: add SQL tests for FIRST/LAST aggregates [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] deps: upgrade to DataFusion 53.0, Arrow to 58.1 [datafusion-comet]
via GitHub
2026/04/02
Re: [I] We do not respect ignoreNulls in first_value / last_value aggregates [datafusion-comet]
via GitHub
2026/04/02
Re: [I] CI: Add spark expression coverage to build process [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix: audit array_insert expression for correctness and test coverage [datafusion-comet]
via GitHub
2026/04/02
[PR] fix: audit array_insert expression for correctness and test coverage [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: enable native_datafusion scan in auto mode [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: enable native_datafusion scan in auto mode [datafusion-comet]
via GitHub
2026/04/02
[PR] Mark array_compact as Compatible and improve test coverage [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/04/02
Re: [PR] perf: Merge Precision in-place [datafusion]
via GitHub
2026/04/02
[I] Implement Spark-compatible array_distinct that preserves insertion order [datafusion-comet]
via GitHub
2026/04/02
[PR] test: improve array_distinct test coverage and incompatibility description [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/02
Re: [PR] feat: add audit-comet-expression Claude Code skill [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: add audit-comet-expression Claude Code skill [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/04/02
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/04/02
Re: [PR] deps: upgrade to DataFusion 53.0, Arrow to 58.1 [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: support LEAD and LAG window functions with IGNORE NULLS [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/02
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/02
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/02
Re: [PR] feat: support LEAD and LAG window functions with IGNORE NULLS [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] fix(spark): preserve raw number text in `json_tuple` to match Spark [datafusion]
via GitHub
2026/04/02
Re: [I] Add missing date/time functions (current_timestamp, date_format, make_time) [datafusion-python]
via GitHub
2026/04/02
Re: [PR] Add missing datetime functions [datafusion-python]
via GitHub
2026/04/02
Re: [PR] Merge queue: make dev checks required + add .asf.yaml validation [datafusion]
via GitHub
2026/04/02
Re: [PR] Merge queue: make dev checks required + add .asf.yaml validation [datafusion]
via GitHub
2026/04/02
Re: [PR] Merge queue: make dev checks required + add .asf.yaml validation [datafusion]
via GitHub
2026/04/02
Re: [PR] fix(spark): preserve raw number text in `json_tuple` to match Spark [datafusion]
via GitHub
2026/04/02
Re: [I] We do not respect ignoreNulls in first_value / last_value aggregates [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] chore: attach Diagnostic to unary operator type errors [datafusion]
via GitHub
2026/04/02
Re: [PR] chore: fix native shuffle for batches with no columns and 0 row count [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] feat: move shuffle writer disk I/O off tokio worker threads [datafusion-ballista]
via GitHub
2026/04/02
[I] Comet throws RuntimeException instead of SparkException for invalid row index column type [datafusion-comet]
via GitHub
2026/04/02
Re: [PR] perf: Merge Precision in-place [datafusion]
via GitHub
2026/04/02
Re: [PR] feat: sort file groups by statistics during sort pushdown (Sort pushdown phase 2) [datafusion]
via GitHub
Earlier messages
Later messages