kishoreg commented on issue #7437: URL: https://github.com/apache/pinot/issues/7437#issuecomment-924054486
@amrishlal I suggest reading the design doc in detail again. The problem with any combination of approx filter and then scan to filter again can have bad edge cases. Moreover, the proposal from @richardstartin will completely eliminate the need for a linear scan. - square root of N does not really lead to optimal distribution. Also, this might lead to a lot of buckets which will be really bad - binary searching of buckets will not make a huge difference when the number of buckets is low. Also, note that we need to find a list of buckets, not just one bucket. A range can span multiple buckets. -We cannot do binary search within the bucket since the posting lists are ordered by docIds and not values. Overall, I am strong +1 on this proposal. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org