dependabot[bot] opened a new pull request, #1523: URL: https://github.com/apache/datafusion-python/pull/1523
Bumps [arrow-schema](https://github.com/apache/arrow-rs) from 58.1.0 to 58.2.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/releases">arrow-schema's releases</a>.</em></p> <blockquote> <h2>arrow 58.2.0</h2> <!-- raw HTML omitted --> <h1>Changelog</h1> <h2><a href="https://github.com/apache/arrow-rs/tree/58.2.0">58.2.0</a> (2026-04-28)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/58.1.0...58.2.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Expose ColumnCloseResult on ArrowColumnChunk <a href="https://redirect.github.com/apache/arrow-rs/issues/9774">#9774</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Expose FFI data structures fields <a href="https://redirect.github.com/apache/arrow-rs/issues/9771">#9771</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>short-circuit last predicate in <code>RowFilter</code> when <code>with_limit(N)</code> is set <a href="https://redirect.github.com/apache/arrow-rs/issues/9765">#9765</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>vectorise dict-index bounds check <a href="https://redirect.github.com/apache/arrow-rs/issues/9747">#9747</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Refactor <code>RleEncoder::flush_bit_packed_run</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9734">#9734</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Add benchmark for cast from/to decimals <a href="https://redirect.github.com/apache/arrow-rs/issues/9728">#9728</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Add a security policy for arrow-rs <a href="https://redirect.github.com/apache/arrow-rs/issues/9727">#9727</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>]</li> <li>Support <code>FixedSizeList</code> in arrow-json reader <a href="https://redirect.github.com/apache/arrow-rs/issues/9714">#9714</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add <code>VariantArrayBuilder::append_nulls</code> API <a href="https://redirect.github.com/apache/arrow-rs/issues/9684">#9684</a></li> <li>[Json] RunEndEncoded decoder optimization <a href="https://redirect.github.com/apache/arrow-rs/issues/9645">#9645</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] <code>variant_get(..., List<_>)</code> non-Struct types support <a href="https://redirect.github.com/apache/arrow-rs/issues/9615">#9615</a></li> <li>[Variant] Add unshredded <code>Struct</code> fast-path for <code>variant_get(..., Struct)</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9596">#9596</a></li> <li>Allow setting custom line terminator for CSV writer <a href="https://redirect.github.com/apache/arrow-rs/issues/9571">#9571</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Align cast logic for <code>variant_get</code> to cast kernel for numeric/bool types <a href="https://redirect.github.com/apache/arrow-rs/issues/9564">#9564</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ci: use ubuntu-slim where applicable <a href="https://redirect.github.com/apache/arrow-rs/issues/9536">#9536</a></li> <li>Publicly export <code>arrow_string::Predicate</code> and its methods? <a href="https://redirect.github.com/apache/arrow-rs/issues/9480">#9480</a></li> <li>Don't create CompressionContext when no compression is selected [IPC] <a href="https://redirect.github.com/apache/arrow-rs/issues/9463">#9463</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet: Raw level buffering causes unbounded memory growth for sparse columns <a href="https://redirect.github.com/apache/arrow-rs/issues/9446">#9446</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Parallel Parquet Reading <a href="https://redirect.github.com/apache/arrow-rs/issues/9381">#9381</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> </ul> <p><strong>Fixed bugs:</strong></p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/apache/arrow-rs/blob/main/CHANGELOG.md">arrow-schema's changelog</a>.</em></p> <blockquote> <h2><a href="https://github.com/apache/arrow-rs/tree/58.2.0">58.2.0</a> (2026-04-28)</h2> <p><a href="https://github.com/apache/arrow-rs/compare/58.1.0...58.2.0">Full Changelog</a></p> <p><strong>Implemented enhancements:</strong></p> <ul> <li>Expose ColumnCloseResult on ArrowColumnChunk <a href="https://redirect.github.com/apache/arrow-rs/issues/9774">#9774</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Expose FFI data structures fields <a href="https://redirect.github.com/apache/arrow-rs/issues/9771">#9771</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>short-circuit last predicate in <code>RowFilter</code> when <code>with_limit(N)</code> is set <a href="https://redirect.github.com/apache/arrow-rs/issues/9765">#9765</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>vectorise dict-index bounds check <a href="https://redirect.github.com/apache/arrow-rs/issues/9747">#9747</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Refactor <code>RleEncoder::flush_bit_packed_run</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9734">#9734</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Add benchmark for cast from/to decimals <a href="https://redirect.github.com/apache/arrow-rs/issues/9728">#9728</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Add a security policy for arrow-rs <a href="https://redirect.github.com/apache/arrow-rs/issues/9727">#9727</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>]</li> <li>Support <code>FixedSizeList</code> in arrow-json reader <a href="https://redirect.github.com/apache/arrow-rs/issues/9714">#9714</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Add <code>VariantArrayBuilder::append_nulls</code> API <a href="https://redirect.github.com/apache/arrow-rs/issues/9684">#9684</a></li> <li>[Json] RunEndEncoded decoder optimization <a href="https://redirect.github.com/apache/arrow-rs/issues/9645">#9645</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] <code>variant_get(..., List<_>)</code> non-Struct types support <a href="https://redirect.github.com/apache/arrow-rs/issues/9615">#9615</a></li> <li>[Variant] Add unshredded <code>Struct</code> fast-path for <code>variant_get(..., Struct)</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9596">#9596</a></li> <li>Allow setting custom line terminator for CSV writer <a href="https://redirect.github.com/apache/arrow-rs/issues/9571">#9571</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>[Variant] Align cast logic for <code>variant_get</code> to cast kernel for numeric/bool types <a href="https://redirect.github.com/apache/arrow-rs/issues/9564">#9564</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>ci: use ubuntu-slim where applicable <a href="https://redirect.github.com/apache/arrow-rs/issues/9536">#9536</a></li> <li>Publicly export <code>arrow_string::Predicate</code> and its methods? <a href="https://redirect.github.com/apache/arrow-rs/issues/9480">#9480</a></li> <li>Don't create CompressionContext when no compression is selected [IPC] <a href="https://redirect.github.com/apache/arrow-rs/issues/9463">#9463</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet: Raw level buffering causes unbounded memory growth for sparse columns <a href="https://redirect.github.com/apache/arrow-rs/issues/9446">#9446</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>Parallel Parquet Reading <a href="https://redirect.github.com/apache/arrow-rs/issues/9381">#9381</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> </ul> <p><strong>Fixed bugs:</strong></p> <ul> <li>[Variant] <code>unshred_variant</code> panics on malformed bytes despite returning <code>Result</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9740">#9740</a></li> <li>RecordBatch::normalize() does not propagate top level null bitmap into the results <a href="https://redirect.github.com/apache/arrow-rs/issues/9732">#9732</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Incorrect accounting in <code>DictEncoder::estimated_memory_size</code> <a href="https://redirect.github.com/apache/arrow-rs/issues/9719">#9719</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>arrow-ipc writer does not comply with spec for empty variable-size arrays <a href="https://redirect.github.com/apache/arrow-rs/issues/9716">#9716</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Panic when reading corrupt parquet file with truncated data instead of ParquetError <a href="https://redirect.github.com/apache/arrow-rs/issues/9705">#9705</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> <li>NOTICE.txt is inaccurate <a href="https://redirect.github.com/apache/arrow-rs/issues/9703">#9703</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Unnecessary dependency on regex crate <a href="https://redirect.github.com/apache/arrow-rs/issues/9672">#9672</a></li> <li>[arrow-avro] Avro reader produces incorrect results when reader schema and writer schema differ <a href="https://redirect.github.com/apache/arrow-rs/issues/9655">#9655</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>parquet docs are broken on docs.rs <a href="https://redirect.github.com/apache/arrow-rs/issues/9649">#9649</a></li> <li>[Parquet] ArrowWriter with CDC panics on nested ListArrays <a href="https://redirect.github.com/apache/arrow-rs/issues/9637">#9637</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>]</li> <li>Use release KEYS file for verification instead of dev KEYS <a href="https://redirect.github.com/apache/arrow-rs/issues/9603">#9603</a></li> <li>IPC reader: handling of dictionaries with only null values <a href="https://redirect.github.com/apache/arrow-rs/issues/9595">#9595</a> [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>]</li> <li>Parquet RleDecoder::get_batch_with_dict panics on oob dictionary indices <a href="https://redirect.github.com/apache/arrow-rs/issues/9434">#9434</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>]</li> </ul> <p><strong>Documentation updates:</strong></p> <ul> <li>docs(variant): link VariantArray doc to official Parquet Variant extension type <a href="https://redirect.github.com/apache/arrow-rs/pull/9779">#9779</a> (<a href="https://github.com/mcharrel">mcharrel</a>)</li> <li>Document Security Policy <a href="https://redirect.github.com/apache/arrow-rs/pull/9730">#9730</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow">arrow</a>] [<a href="https://github.com/apache/arrow-rs/labels/arrow-flight">arrow-flight</a>] (<a href="https://github.com/alamb">alamb</a>)</li> <li>Docs: add example of how to read parquet row groups in parallel <a href="https://redirect.github.com/apache/arrow-rs/pull/9396">#9396</a> [<a href="https://github.com/apache/arrow-rs/labels/parquet">parquet</a>] (<a href="https://github.com/alamb">alamb</a>)</li> </ul> <p><strong>Performance improvements:</strong></p> <!-- raw HTML omitted --> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/apache/arrow-rs/commit/fc3f77896295df9cd72891cc4d8fdf6349244887"><code>fc3f778</code></a> Prepare for 58.2.0 release (<a href="https://redirect.github.com/apache/arrow-rs/issues/9800">#9800</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/3c4311c42971046a090308239111ad92181da13c"><code>3c4311c</code></a> Prevent buffer builder length overflow in <code>MutableBuffer::extend_zeros</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/9820">#9820</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/11f13a50a1abb483eaa77da24e90cdf0accee244"><code>11f13a5</code></a> feat(parquet): batch RLE runs in level encoder via scan-ahead (<a href="https://redirect.github.com/apache/arrow-rs/issues/9830">#9830</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/a3dbc154af4b9f6aebf38b4e5343fed176449a8e"><code>a3dbc15</code></a> feat: add <code>has_non_empty_nulls</code> helper function in <code>OffsetBuffer</code> (<a href="https://redirect.github.com/apache/arrow-rs/issues/9711">#9711</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/b00b5aa3bbe3aab8ea6595fdd99794a11c1ca730"><code>b00b5aa</code></a> fix(ipc): correct skip_field handling for V4 Union (<a href="https://redirect.github.com/apache/arrow-rs/issues/9829">#9829</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/b4c02d0e248c4550af1fa3229c6d7d05662f5326"><code>b4c02d0</code></a> support length() on Run-end encoding arrays (<a href="https://redirect.github.com/apache/arrow-rs/issues/9838">#9838</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/710e68e664ea2f17986d70e621634d76544fe2c6"><code>710e68e</code></a> Prevent ArrayData validation length overflow (<a href="https://redirect.github.com/apache/arrow-rs/issues/9816">#9816</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/cb8d4c025b65ffa7fcc7bff73a1dee1004759417"><code>cb8d4c0</code></a> fix: correct accounting in <code>DictEncoder::estimated_memory_size</code>, `Interner::e...</li> <li><a href="https://github.com/apache/arrow-rs/commit/4fa8d2ff5f18f2d773f9642631715509f844a062"><code>4fa8d2f</code></a> fix(ipc): replace wildcard in skip_field with explicit DataType handling (<a href="https://redirect.github.com/apache/arrow-rs/issues/9822">#9822</a>)</li> <li><a href="https://github.com/apache/arrow-rs/commit/c4b2569276b5da0a57a18abaf3de6e1b04676c1b"><code>c4b2569</code></a> Replace <code>BooleanBufferBuilder</code> with <code>NullBufferBuilder</code> in arrow-json if appl...</li> <li>Additional commits viewable in <a href="https://github.com/apache/arrow-rs/compare/58.1.0...58.2.0">compare view</a></li> </ul> </details> <br /> [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
