kishoreg commented on issue #7437:
URL: https://github.com/apache/pinot/issues/7437#issuecomment-924054486


   @amrishlal I suggest reading the design doc in detail again.
   
   The problem with any combination of approx filter and then scan to filter 
again can have bad edge cases. Moreover, the proposal from @richardstartin will 
completely eliminate the need for a linear scan. 
   
   - square root of N does not really lead to optimal distribution. Also, this 
might lead to a lot of buckets which will be really bad
   - binary searching of buckets will not make a huge difference when the 
number of buckets is low. Also, note that we need to find a list of buckets, 
not just one bucket. A range can span multiple buckets.
   -We cannot do binary search within the bucket since the posting lists are 
ordered by docIds and not values.
   
   Overall, I am strong +1 on this proposal. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to