msokolov commented on issue #12439: URL: https://github.com/apache/lucene/issues/12439#issuecomment-1634804190
I wonder if docfreq of the terms should also be considered? EG perhaps only low-frequency terms should be used for pruning. Just thinking out loud, I have no idea how we'd do that. But are you able to look at statistics of avg/max docfreq across the terms in each query? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org