jpountz commented on PR #12526:
URL: https://github.com/apache/lucene/pull/12526#issuecomment-1699741992

   I added a few tasks that I'm adding here for reference to see how it plays 
with disjunctions that have more terms or different document frequencies:
   
   ```
   OrHighVeryLow: 2005 mousehole # freq=835460 freq=123
   OrHighVeryLow: until motorboats # freq=425389 freq=128
   OrHighVeryLow: made monceau # freq=742313 freq=126
   OrHighVeryLow: do bush's # freq=511178 freq=2681
   OrHighVeryLow: 10 mikup # freq=918339 freq=119
   OrHighMedLow: international chris valois
   OrHighMedLow: right million universalist
   OrHighMedLow: known created forays
   OrHighMedLow: its universal bush's
   OrHighMedLow: 9 network racedetail.html
   OrHighHighHigh: 2005 until made
   OrHighHighHigh: do 10 international
   OrHighHighHigh: right known its
   OrHighHighHigh: until 10 known
   OrHighHighHigh: made international its
   OrHighMedMed: international chris million
   OrHighMedMed: right million created
   OrHighMedMed: known created universal
   OrHighMedMed: its universal network
   OrHighMedMed: 9 network chris
   OrHighHighLow: several following valois
   OrHighHighLow: publisher end universalist
   OrHighHighLow: 2009 film forays
   OrHighHighLow: http known bush's
   OrHighHighLow: south county racedetail.html
   OrHighHighMed: international right million
   OrHighHighMed: right known created
   OrHighMighMed: known its universal
   OrHighHighMed: its 9 network
   OrHighHighMed: 9 international chris
   ```
   
   ```
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
                       OrHighMedMed      158.53      (3.6%)      155.92      
(4.4%)   -1.7% (  -9% -    6%) 0.193
                     OrHighHighHigh       53.97      (5.0%)       53.13      
(4.9%)   -1.6% ( -10% -    8%) 0.324
                      OrHighHighMed      106.81      (4.0%)      105.37      
(4.3%)   -1.3% (  -9% -    7%) 0.306
                         OrHighHigh       64.42      (5.6%)       63.64      
(4.0%)   -1.2% ( -10% -    8%) 0.433
                      OrHighMighMed      201.12      (3.7%)      198.74      
(3.5%)   -1.2% (  -8% -    6%) 0.298
                       OrHighMedLow      323.10      (3.7%)      319.32      
(4.2%)   -1.2% (  -8% -    6%) 0.349
                          OrHighMed      227.13      (3.9%)      225.41      
(3.0%)   -0.8% (  -7% -    6%) 0.487
                           HighTerm      652.70      (4.2%)      659.51      
(5.3%)    1.0% (  -8% -   11%) 0.491
                           PKLookup      248.57      (3.4%)      251.38      
(1.9%)    1.1% (  -4% -    6%) 0.198
                            MedTerm     1060.67      (4.5%)     1076.33      
(5.4%)    1.5% (  -8% -   11%) 0.350
                            LowTerm     1639.65      (7.0%)     1667.48      
(4.9%)    1.7% (  -9% -   14%) 0.377
                      OrHighVeryLow      172.35      (8.2%)      196.54      
(8.4%)   14.0% (  -2% -   33%) 0.000
                      OrHighHighLow      449.76      (3.0%)      633.61      
(3.5%)   40.9% (  33% -   48%) 0.000
                          OrHighLow      546.08      (5.4%)     1187.98      
(5.1%)  117.5% ( 101% -  135%) 0.000
   ```
   
   While it tends to help queries that are already fast, it also helped 
OrHighVeryLow above, which is not among the fastest. I also like that none of 
the queries is getting a major slowdown.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to