[ 
https://issues.apache.org/jira/browse/LUCENE-9335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17339188#comment-17339188
 ] 

Adrien Grand commented on LUCENE-9335:
--------------------------------------

Thanks for writing two scorers to test this out! Would you be able to run 
queries under a profiler to see where your new scorers are spending most time? 
This might help identify how we could make them faster.

Also thanks for testing with more queries, FWIW it would be good enough to only 
add 4-5 new queries to the tasks file to play with the change. By the way I'd 
be curious to see how your new scorers perform with 5 "Med" terms, which should 
be a worst-case scenario for BMW as all terms should have similar max scores. 
Since the queries you ran have a "Low" term, I wonder that this term drives 
iteration, which prevents BMM from showing the lower overhead it has compared 
to BMW.

> Add a bulk scorer for disjunctions that does dynamic pruning
> ------------------------------------------------------------
>
>                 Key: LUCENE-9335
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9335
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>         Attachments: wikimedium.10M.nostopwords.tasks
>
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> Lucene often gets benchmarked against other engines, e.g. against Tantivy and 
> PISA at [https://tantivy-search.github.io/bench/] or against research 
> prototypes in Table 1 of 
> [https://cs.uwaterloo.ca/~jimmylin/publications/Grand_etal_ECIR2020_preprint.pdf].
>  Given that top-level disjunctions of term queries are commonly used for 
> benchmarking, it would be nice to optimize this case a bit more, I suspect 
> that we could make fewer per-document decisions by implementing a BulkScorer 
> instead of a Scorer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to