mikemccand commented on issue #1351: LUCENE-9280: Collectors to skip 
noncompetitive documents
URL: https://github.com/apache/lucene-solr/pull/1351#issuecomment-615483497
 
 
   The `wikimedium1m` corpus is really too small to draw strong conclusions -- 
I would use it to run a quick performance test, e.g. to see that it can run to 
completion, not dying with an exception, but then run the real test on 
`wikimediumall`.
   
   Do you know why you are seeing these warnings?
   
   ```
   WARNING: cat=HighTermDayOfYearSort: hit counts differ: 541658 vs 541658+
   WARNING: cat=TermDTSort: hit counts differ: 68644 vs 68644+
   ```
   
   Oh, I guess this is expected, since the optimization may skip some 
non-competitive matches by skipping forward to find possibly competitive hits 
based on current PQ bottom?  But then, why is the warning not something more 
like:
   
   ```
   WARNING: cat=HighTermDayOfYearSort: hit counts differ: 541658 vs 5000+
   ```
   
   I.e., it looks like (for these benchmark queries anyways) the optimization 
did not wind up skipping any hits (though, it thought it may have, hence the 
added `+`)?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to