jpountz commented on PR #972:
URL: https://github.com/apache/lucene/pull/972#issuecomment-1167025504

   > I feel the effect would be similar?
   
   Indeed, sorry I had misread your code!
   
   > In terms of next steps, I'm wondering if there's a preference between bulk 
scorer and scorer implementations when performance improvement is similar
   
   No, it shouldn't matter. Bulk scorers sometimes help yield better 
performance because it's easier for them to amortize computation across docs, 
but if they don't yield better performance, there's no point in using a bulk 
scorer instead of a regular scorer.
   
   I agree that it looks like a great speedup, we should get this in! The 
benchmark only tests performance of top-level disjunctions of term queries that 
have two clauses. I'd be curious to get performance numbers for queries like 
the below ones to see if we need to fine-tune a bit more when this new scorer 
gets used. Note that I don't think we need to get the performance better for 
all these queries to merge the change, we could start by only using this new 
scorer for the (common) case of a top-level disjunction of 2 term queries, and 
later see if this scorer can handle more disjunctions.
   
   ```
   OrAndHigMedAndHighMed: (+including +looking) (+date +finished) # disjunction 
of conjunctions, which don't have as good score upper bounds as term queries
   OrHighPhraseHighPhrase: "united states" "new york" # disjunction of phrase 
queries, which don't have as good score upper bounds as term queries and are 
slow to advance
   AndHighOrMedMed: +be +(mostly interview) # disjunction within conjunction 
that leads iteration
   AndMedOrHighHigh: +interview +(at united) # disjunction within conjunction 
that doesn't lead iteration
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to