Jackie-Jiang commented on issue #9109: URL: https://github.com/apache/pinot/issues/9109#issuecomment-1197120271
We can enhance the current distinct count bitmap to support these asks: > 1. stored the query result as the bitmap (binary data type) for further aggregation. Add `DistinctCountRawBitmapAggregationFunction` which returns the serialized bitmap as the aggregation result. See `DistinctCountRawHLLAggregationFunction` as an example which is added for the same purpose. > 2. calculate the cardinality of bitmap (stored as binary data type). Trying to understand the ask here. Are you suggesting adding a new transform which takes a serialized bitmap and returns the cardinality of it? If my understanding is correct, it can be done by adding a new scalar function. > 3. bitmap aggregation: merge two bitmap into new a bitmap when we roll up the metric for finest granularity to higher granularity If you are referring to the metric aggregation during real-time consumption, it can be a little bit tricky because the serialized bitmap is not fixed size. We need to re-design the mechanism of metric aggregation to support storing var-length result. These are all very good asks! Do you want to help contribute them? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org