[GitHub] [lucene] gsmiller commented on pull request #12055: Better skipping for multi-term queries with a FILTER rewrite.

GitBox Mon, 02 Jan 2023 13:20:25 -0800


gsmiller commented on PR #12055:
URL: https://github.com/apache/lucene/pull/12055#issuecomment-1369212139


   +1 to this approach in general.
   
   I do wonder if the distribution assumptions generally hold if we start 
looking at "term in set" queries though. That's sort of irrelevant right now 
since that implementation is still separate (`TermInSetQuery`), but this may 
add another reason to keep that implementation separate going forward. I think 
the difference with "term in set" is that it may not follow natural language 
distributions in general, while the current MultiTermQuery implementations most 
likely do.
   
   I also wonder if we could be more aggressive with the number of clauses we 
build into a `BooleanQuery` if we leverage the short-circuiting idea in #11928. 
Might be a nice fit for this "filtering" case.
   
   Just a couple thoughts but certainly nothing blocking or anything that needs 
to be included as part of this PR. Just wanted to toss them out there.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene] gsmiller commented on pull request #12055: Better skipping for multi-term queries with a FILTER rewrite.

Reply via email to