msokolov commented on PR #12434:
URL: https://github.com/apache/lucene/pull/12434#issuecomment-1657264032

   @jpountz I think I see what you mean re: the cost of evaluating filters. We 
have been working on a solution where the most common and restrictive filters 
are cached and can be applied as pre-filters without needing to recompute for 
every query, while also joining with other constraints (post-filtering). This 
has led to increased yield (more high-scoring matches) at acceptable cost 
increase in many cases but in some cases we see increased costs without much 
better yield (as compared to applying all the filters as post-filters). So I 
feel like if the parent doc filter is highly restrictive there could be a 
similar tradeoff in this case ie even when the cost of the parent doc filter 
computation is not a consideration you might still see better recall/latency 
tradeoff using overcollection/post-filtering. I don't think this has any 
implications for this change really, but I do think it would be interesting to 
see some evaluation comparing.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to