kaivalnp opened a new pull request, #932:
URL: https://github.com/apache/lucene/pull/932

   ### Description
   Link to [Jira](https://issues.apache.org/jira/browse/LUCENE-10559)
   
   ### Solution
   
   Added a `prefilter` and `filterSelectivity` argument to KnnGraphTester to be 
able to compare pre and post-filtering benchmarks
   
   `filterSelectivity` expresses the selectivity of a filter as proportion of 
passing docs that are randomly selected. We store these in a FixedBitSet and 
use this to calculate true KNN as well as in HNSW search
   
   In case of post-filter, we over-select results as `topK / filterSelectivity` 
to get final hits close to actual requested `topK`
   For pre-filter, we wrap the FixedBitSet in a query and pass it as prefilter 
argument to KnnVectorQuery


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to