zacharymorn opened a new pull request #418:
URL: https://github.com/apache/lucene/pull/418


   # Description
   This PR is WIP for implementing basic dynamic pruning support for 
CombinedFieldsQuery
   
   # Tests
   
   Added a test to compare performance between top_score (with pruning) and 
complete scoring, and verify top 100 results match between the two. Here are 
the results:
   
   Query:
   ```
       CombinedFieldQuery query =
               new CombinedFieldQuery.Builder()
                       .addField("title", (float) 4.0)
                       .addField("body", (float) 2.0)
                       .addTerm(new BytesRef(termA))
                       .addTerm(new BytesRef(termB))
                       .build();
   ```
   
   Results:
   ```
   HighMed: from office # freq=3224339 freq=225338
   top scores time usage        425 milliseconds
   complete time usage          401 milliseconds
       
   HighHigh: but publisher # freq=1456553 freq=1289029
   top scores time usage        469 milliseconds
   complete time usage          322 milliseconds
       
   HighLow: with fung # freq=3709421 freq=1344
   top scores time usage        241 milliseconds
   complete time usage          428 milliseconds
       
   HighLow: date insult # freq=2020626 freq=4424
   top scores time usage        171 milliseconds
   complete time usage          225 milliseconds
    ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to