jasperjiaguo opened a new issue, #10500: URL: https://github.com/apache/pinot/issues/10500
- Current distinct(count) functions create in-memory sets. -- Increased chances of OOM -- Incurs gc pressure -- Cannot handle high cardinality -- Hard to utilize disk for spilling - Off-heap (direct buffer) hash table based solution can help here. - Can be extended by supporting spilling over to disk - Off heap hash-table can potentially be extended to group-by queries. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org