Messages by Thread
-
-
Re: [I] [EPIC] A collection of support for metadata columns in ListingTable [datafusion]
via GitHub
-
Re: [PR] feat: Add native support for scalar Math expressions [datafusion-comet]
via GitHub
-
Re: [I] Reduce Iceberg CI matrix: pin JDK per Spark version [datafusion-comet]
via GitHub
-
Re: [I] Skip defensive copy when unpacking dictionary arrays in UnpackOrClone mode [datafusion-comet]
via GitHub
-
Re: [I] Avoid unpacking dictionaries for inputs to SortExec [datafusion-comet]
via GitHub
-
[I] Preserve dictionary encoding through native expressions where possible [datafusion-comet]
via GitHub
-
Re: [I] Add with_virtual_columns to ParquetSource for reading virtual columns [datafusion]
via GitHub
-
[PR] fix: resolve Scala compiler warnings for auto-tupling and bare try [datafusion-comet]
via GitHub
-
[PR] Make Expr::alias and alias_qualified smarter by calling unalias [datafusion]
via GitHub
-
Re: [PR] Reduce cloning in LogicalPlanBuilder [datafusion]
via GitHub
-
[PR] test: skip flaky StateStoreSuite under Comet and disambiguate JDK matrix names [datafusion-comet]
via GitHub
-
Re: [PR] perf: reduce per-node allocations in to_native_metric_node [datafusion-comet]
via GitHub
-
Re: [I] Expr. simplification / rewrite: regex `.*foo.*` [datafusion]
via GitHub
-
Re: [PR] fix: UNIQUE constraint with NULLs incorrectly collapses GROUP BY groups [datafusion]
via GitHub
-
Re: [PR] Add configurable UNION DISTINCT to FILTER rewrite optimization [datafusion]
via GitHub
-
Re: [PR] Postgres regression 7b [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
-
Re: [PR] ci: add a CI job that builds without the lockfile [datafusion]
via GitHub
-
Re: [I] Test flake in `explain_analyze.slt` [datafusion]
via GitHub
-
Re: [PR] fix(substrait): normalize table names from Substrait NamedTable for Calcite interop [datafusion]
via GitHub
-
[PR] fix: JNI local reference cleanup in JVMClasses::with_env [datafusion-comet]
via GitHub
-
[I] Support higher-order array functions via JVM UDF bridge [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump ctor from 0.10.1 to 1.0.1 [datafusion]
via GitHub
-
[PR] chore(deps): bump the all-other-cargo-deps group with 2 updates [datafusion]
via GitHub
-
[PR] feat: implement array_exists with lambda support via JVM UDF bridge [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump the arrow-parquet group with 9 updates [datafusion]
via GitHub
-
[PR] chore(deps): bump github/codeql-action from 4.35.2 to 4.35.3 [datafusion]
via GitHub
-
[PR] chore(deps): bump taiki-e/install-action from 2.74.0 to 2.77.0 [datafusion]
via GitHub
-
Re: [PR] perf : experiment roaring bitmap for int32 anti and semi joins [datafusion]
via GitHub
-
[PR] fix: drop input plan early in `CoalescePartitionsExec` [datafusion]
via GitHub
-
Re: [PR] chore: Add existence (semi / anti ) benchmarks for hashjoinexec [datafusion]
via GitHub
-
[I] `CoalescePartitionsExec` delays cancellation of child operators [datafusion]
via GitHub
-
Re: [I] [EPIC] Make DataFusion the top of the ClickBench Parquet leaderboard [datafusion]
via GitHub
-
[PR] fix: support unhex on dictionary strings [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump tokio from 1.52.1 to 1.52.2 [datafusion-ballista]
via GitHub
-
[PR] chore(deps): bump ctor from 0.12.0 to 1.0.1 [datafusion-ballista]
via GitHub
-
[PR] feat: implement retract_batch for array_agg sliding window support [datafusion]
via GitHub
-
[PR] Support '0' value for parse_capacity_limit() [datafusion]
via GitHub
-
[I] Spark SQL `maintenance` test fails intermittently [datafusion-comet]
via GitHub
-
Re: [PR] feat: Support Spark expression hours_of_time [datafusion-comet]
via GitHub
-
Re: [PR] Add `rust-required-checks` [datafusion]
via GitHub
-
Re: [PR] implement `preimage` for date_trunc [datafusion]
via GitHub
-
Re: [PR] OptimizeProjections: safely prune struct-only UNNEST when outputs are unused [datafusion]
via GitHub
-
Re: [PR] feat: Improve `partition_statistics()` for `AggregateExec` using `distinct_count` [datafusion]
via GitHub
-
[PR] test: add INT96 TimestampNTZ correctness tests [datafusion-comet]
via GitHub
-
[I] native_datafusion more permissive than Spark 3.x when reading Parquet TimestampNTZ columns [datafusion-comet]
via GitHub
-
[PR] feat: add array_normalize scalar function [datafusion]
via GitHub
-
[D] [datafusion-spark] Add physical implementations for functions that only have simplify() [datafusion]
via GitHub
-
[I] Native DataFusion scan silently returns wrong values reading INT96 as TimestampNTZ [datafusion-comet]
via GitHub
-
[PR] deps: Bump OpenDAL to 0.56.0 [datafusion-comet]
via GitHub
-
[PR] feat: support Parquet field ID matching in native_datafusion scan [datafusion-comet]
via GitHub
-
Re: [I] [native_datafusion] Add support for reading row index metadata columns [datafusion-comet]
via GitHub
-
Re: [I] [DISCUSS] Representing Shared State / `ExecutionPlan::reset_state` [datafusion]
via GitHub
-
Re: [I] `lower`, `upper` could be further optimized for ASCII-only inputs [datafusion]
via GitHub
-
Re: [I] Support dict encoded structs in `get_field` [datafusion]
via GitHub
-
Re: [PR] feat: AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
-
[PR] feat: support AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
-
Re: [I] Write a wikipedia article for Apache DataFusion [datafusion]
via GitHub
-
Re: [I] Comet should fallback to Spark for streaming queries [datafusion-comet]
via GitHub
-
Re: [I] Unsupported aggregation mode PartialMerge [datafusion-comet]
via GitHub
-
Re: [PR] Add support for PostgreSQL's ORDER BY ... USING <operator> clause [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] test: extend SPARK-43402 plan-match to CometNativeScanExec and retag to #4042 [datafusion-comet]
via GitHub
-
[PR] fix: include per-column details in exportBatch row count mismatch error [datafusion-comet]
via GitHub
-
[PR] Map ProfileCredentialsProvider to profiel credential chain [datafusion-comet]
via GitHub
-
[I] Support AWS ProfileCredentialsProvider in native S3 object store [datafusion-comet]
via GitHub
-
[I] Number of rows in each column should be the same, but got [ArrayBuffer(8192, 0)] [datafusion-comet]
via GitHub
-
[PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
[PR] docs: document Spark version labels in bug triage guide [datafusion-comet]
via GitHub
-
[PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub