officialasishkumar opened a new pull request, #21455:
URL: https://github.com/apache/datafusion/pull/21455

   ## Which issue does this PR close?
   
   - Closes #20273.
   
   ## Rationale for this change
   
   `DataFrame::describe()` builds min/max aggregates for non-numeric columns 
and then casts the results to `Utf8` for display. That works for strings, but 
it fails for unsupported binary-like outputs such as `FixedSizeBinary`, which 
currently causes describe to error instead of falling back to null summary 
values.
   
   ## What changes are included in this PR?
   
   - Treat `FixedSizeBinary` as an unsupported min/max describe type, alongside 
the other binary-like types that cannot be rendered through the current `Utf8` 
cast path.
   - Reuse the same type predicate for both min and max summary construction.
   - Add a regression test covering `describe()` on a `FixedSizeBinary` column.
   
   ## Are these changes tested?
   
   - `cargo test -p datafusion --test core_integration 
describe_fixed_size_binary -- --nocapture`
   - `cargo test -p datafusion --test core_integration dataframe::describe:: -- 
--nocapture`
   
   ## Are there any user-facing changes?
   
   `describe()` no longer errors on `FixedSizeBinary` columns; unsupported 
min/max summaries now fall back to `null` as intended.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to