kaivalnp opened a new pull request, #932: URL: https://github.com/apache/lucene/pull/932
### Description Link to [Jira](https://issues.apache.org/jira/browse/LUCENE-10559) ### Solution Added a `prefilter` and `filterSelectivity` argument to KnnGraphTester to be able to compare pre and post-filtering benchmarks `filterSelectivity` expresses the selectivity of a filter as proportion of passing docs that are randomly selected. We store these in a FixedBitSet and use this to calculate true KNN as well as in HNSW search In case of post-filter, we over-select results as `topK / filterSelectivity` to get final hits close to actual requested `topK` For pre-filter, we wrap the FixedBitSet in a query and pass it as prefilter argument to KnnVectorQuery -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org