jpountz commented on PR #12526: URL: https://github.com/apache/lucene/pull/12526#issuecomment-1699741992
I added a few tasks that I'm adding here for reference to see how it plays with disjunctions that have more terms or different document frequencies: ``` OrHighVeryLow: 2005 mousehole # freq=835460 freq=123 OrHighVeryLow: until motorboats # freq=425389 freq=128 OrHighVeryLow: made monceau # freq=742313 freq=126 OrHighVeryLow: do bush's # freq=511178 freq=2681 OrHighVeryLow: 10 mikup # freq=918339 freq=119 OrHighMedLow: international chris valois OrHighMedLow: right million universalist OrHighMedLow: known created forays OrHighMedLow: its universal bush's OrHighMedLow: 9 network racedetail.html OrHighHighHigh: 2005 until made OrHighHighHigh: do 10 international OrHighHighHigh: right known its OrHighHighHigh: until 10 known OrHighHighHigh: made international its OrHighMedMed: international chris million OrHighMedMed: right million created OrHighMedMed: known created universal OrHighMedMed: its universal network OrHighMedMed: 9 network chris OrHighHighLow: several following valois OrHighHighLow: publisher end universalist OrHighHighLow: 2009 film forays OrHighHighLow: http known bush's OrHighHighLow: south county racedetail.html OrHighHighMed: international right million OrHighHighMed: right known created OrHighMighMed: known its universal OrHighHighMed: its 9 network OrHighHighMed: 9 international chris ``` ``` TaskQPS baseline StdDevQPS my_modified_version StdDev Pct diff p-value OrHighMedMed 158.53 (3.6%) 155.92 (4.4%) -1.7% ( -9% - 6%) 0.193 OrHighHighHigh 53.97 (5.0%) 53.13 (4.9%) -1.6% ( -10% - 8%) 0.324 OrHighHighMed 106.81 (4.0%) 105.37 (4.3%) -1.3% ( -9% - 7%) 0.306 OrHighHigh 64.42 (5.6%) 63.64 (4.0%) -1.2% ( -10% - 8%) 0.433 OrHighMighMed 201.12 (3.7%) 198.74 (3.5%) -1.2% ( -8% - 6%) 0.298 OrHighMedLow 323.10 (3.7%) 319.32 (4.2%) -1.2% ( -8% - 6%) 0.349 OrHighMed 227.13 (3.9%) 225.41 (3.0%) -0.8% ( -7% - 6%) 0.487 HighTerm 652.70 (4.2%) 659.51 (5.3%) 1.0% ( -8% - 11%) 0.491 PKLookup 248.57 (3.4%) 251.38 (1.9%) 1.1% ( -4% - 6%) 0.198 MedTerm 1060.67 (4.5%) 1076.33 (5.4%) 1.5% ( -8% - 11%) 0.350 LowTerm 1639.65 (7.0%) 1667.48 (4.9%) 1.7% ( -9% - 14%) 0.377 OrHighVeryLow 172.35 (8.2%) 196.54 (8.4%) 14.0% ( -2% - 33%) 0.000 OrHighHighLow 449.76 (3.0%) 633.61 (3.5%) 40.9% ( 33% - 48%) 0.000 OrHighLow 546.08 (5.4%) 1187.98 (5.1%) 117.5% ( 101% - 135%) 0.000 ``` While it tends to help queries that are already fast, it also helped OrHighVeryLow above, which is not among the fastest. I also like that none of the queries is getting a major slowdown. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org