EmmyMiao87 commented on issue #5509:
URL: 
https://github.com/apache/incubator-doris/issues/5509#issuecomment-797417753


   The bitmap algorithm itself does not have very good computational 
performance when the cardinality is large and the distribution is loose. 
Performance tuning needs to be combined with modeling and real data 
distribution.
   1. Through the global dictionary, the value distribution of bitmap is 
changed from sparse to compact
   2. Observe your own data distribution to see if you can use the udaf 
orthogonal bitmap function
   How to use: 
http://doris.apache.org/master/zh-CN/extending-doris/udf/contrib/udaf-orthogonal-bitmap-manual.html
   
   By the way, I would like to recommend an article to you: A performance 
tuning case for small programs using bitmap to achieve precision marketing.
   https://blog.csdn.net/weixin_47452131/article/details/113393764
   
   bitmap算法本身在基数大且分布松散的情况下,计算性能不是很好。需要结合建模,真实数据分布进行性能调优。
   1. 通过全局字典的方式,使得bitmap的数值分布从稀疏改为紧凑
   2. 观察自己的数据分布,看是否能利用上udaf 正交bitmap函数
   使用方式: 
http://doris.apache.org/master/zh-CN/extending-doris/udf/contrib/udaf-orthogonal-bitmap-manual.html
   
   顺便给你推荐个文章:小程序使用 bitmap 实现精准营销的性能调优案例。
   https://blog.csdn.net/weixin_47452131/article/details/113393764


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to