comphead opened a new issue, #21771:
URL: https://github.com/apache/datafusion/issues/21771
### Is your feature request related to a problem or challenge?
Existing docs:
- docs/source/library-user-guide/query-optimizer.md — good high-level
overview of the optimizer architecture + how to write custom rules. Does NOT
list/describe individual built-in rules.
- datafusion/optimizer/README.md and
datafusion/physical-optimizer/README.md — brief crate-level READMEs, mostly
pointers to the above doc and blog posts.
- working-with-exprs.md and extending-operators.md — touch on optimizer
rules tangentially.
The gap: There is no reference doc that lists and describes what each
built-in rule actually does. The codebase has 27 logical optimizer rules (e.g.,
push_down_filter, eliminate_cross_join,
common_subexpr_eliminate) and 21 physical optimizer rules (e.g.,
join_selection, enforce_sorting, topk_aggregation), but the only way to learn
what each one does is to read the source or use EXPLAIN
VERBOSE.
So query-optimizer.md is good for "how the optimizer works" and "how to
extend it," but it's not enough if you want a reference of "what optimizations
does DataFusion apply and what does each one do."
That reference doesn't exist yet.
### Describe the solution you'd like
_No response_
### Describe alternatives you've considered
_No response_
### Additional context
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]