EmmyMiao87 opened a new issue #6370: URL: https://github.com/apache/incubator-doris/issues/6370
# Background Doris's current statistics are only inaccurate table rows and table size. Due to the lack of some more accurate column statistics, the Doris query plan cannot evaluate and select a better plan. Also the current statistical information is not a complete structure. I propose to add a statistical information collection module in the query layer. Mainly responsible for the collection, storage, use and other functions of statistical information, so as to assist the optimizer to obtain better query planning. # Requirements 1. Statistics include table stat (row count, data size), column stat (ndv, min, max, null_nums etc) 2. Show statistics 3. Collect statistics (sample, all) 4. Storage statistics (memory and persistence) 5. Up-to-date or out-to-date 6. Compatible with old statistical logic -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org