jpountz commented on PR #964:
URL: https://github.com/apache/lucene/pull/964#issuecomment-1160079281
Now when collectors need to count hits too (I changed IndexSearcher's
`TOTAL_HITS_THRESHOLD` to `Integer.MAX_VALUE`):
```
TaskQPS baseline StdDevQPS
my_modified_version StdDev Pct diff p-value
OrHighLow 90.31 (6.7%) 35.49
(1.8%) -60.7% ( -64% - -55%) 0.000
OrHighHigh 40.98 (5.6%) 21.17
(2.4%) -48.3% ( -53% - -42%) 0.000
OrHighNotLow 143.61 (8.2%) 76.04
(5.4%) -47.1% ( -56% - -36%) 0.000
OrHighNotMed 88.77 (7.7%) 49.41
(5.6%) -44.3% ( -53% - -33%) 0.000
OrHighNotHigh 18.24 (7.4%) 10.59
(5.9%) -41.9% ( -51% - -30%) 0.000
OrHighMed 80.82 (5.0%) 48.18
(2.8%) -40.4% ( -45% - -34%) 0.000
OrNotHighHigh 51.35 (5.7%) 39.11
(5.6%) -23.8% ( -33% - -13%) 0.000
AndHighHigh 53.49 (1.9%) 41.97
(4.1%) -21.5% ( -27% - -15%) 0.000
AndHighMed 321.43 (2.4%) 258.39
(4.5%) -19.6% ( -25% - -13%) 0.000
AndHighLow 1777.06 (2.7%) 1474.52
(3.1%) -17.0% ( -22% - -11%) 0.000
MedPhrase 391.41 (5.9%) 332.93
(5.1%) -14.9% ( -24% - -4%) 0.000
OrNotHighMed 313.44 (6.7%) 269.25
(5.3%) -14.1% ( -24% - -2%) 0.000
OrNotHighLow 1977.65 (4.2%) 1803.88
(4.7%) -8.8% ( -16% - 0%) 0.000
AndHighHighDayTaxoFacets 25.28 (1.7%) 23.30
(1.9%) -7.8% ( -11% - -4%) 0.000
MedTermDayTaxoFacets 79.97 (2.6%) 74.42
(3.6%) -6.9% ( -12% - 0%) 0.000
Prefix3 27.72 (6.2%) 25.83
(5.2%) -6.8% ( -17% - 4%) 0.000
LowPhrase 159.63 (5.0%) 148.90
(3.3%) -6.7% ( -14% - 1%) 0.000
OrHighMedDayTaxoFacets 19.30 (5.7%) 18.11
(4.2%) -6.2% ( -15% - 3%) 0.000
HighPhrase 16.15 (5.7%) 15.30
(4.6%) -5.2% ( -14% - 5%) 0.001
Wildcard 79.98 (2.3%) 76.50
(3.0%) -4.4% ( -9% - 1%) 0.000
AndHighMedDayTaxoFacets 72.60 (2.1%) 69.79
(1.9%) -3.9% ( -7% - 0%) 0.000
HighSpanNear 44.71 (4.9%) 43.00
(4.7%) -3.8% ( -12% - 6%) 0.012
BrowseDayOfYearTaxoFacets 47.80 (2.0%) 46.05
(12.3%) -3.7% ( -17% - 10%) 0.189
Fuzzy2 103.64 (2.1%) 100.39
(1.7%) -3.1% ( -6% - 0%) 0.000
BrowseDateTaxoFacets 46.13 (1.9%) 44.69
(12.0%) -3.1% ( -16% - 10%) 0.249
BrowseRandomLabelTaxoFacets 37.71 (2.1%) 36.53
(10.5%) -3.1% ( -15% - 9%) 0.195
MedSpanNear 68.62 (3.0%) 66.69
(3.0%) -2.8% ( -8% - 3%) 0.003
LowSpanNear 57.05 (3.0%) 55.49
(2.8%) -2.7% ( -8% - 3%) 0.003
BrowseMonthTaxoFacets 29.68 (7.3%) 28.87
(12.8%) -2.7% ( -21% - 18%) 0.410
Fuzzy1 128.59 (2.2%) 125.27
(1.8%) -2.6% ( -6% - 1%) 0.000
LowIntervalsOrdered 219.66 (4.2%) 216.18
(3.4%) -1.6% ( -8% - 6%) 0.184
HighIntervalsOrdered 35.55 (5.7%) 35.03
(4.3%) -1.5% ( -10% - 9%) 0.361
HighSloppyPhrase 8.33 (15.0%) 8.22
(13.6%) -1.3% ( -25% - 32%) 0.775
BrowseDayOfYearSSDVFacets 21.93 (9.9%) 21.80
(9.3%) -0.6% ( -17% - 20%) 0.841
BrowseMonthSSDVFacets 23.61 (8.5%) 23.53
(8.2%) -0.3% ( -15% - 17%) 0.904
Respell 77.59 (2.0%) 77.42
(2.3%) -0.2% ( -4% - 4%) 0.740
BrowseRandomLabelSSDVFacets 15.20 (5.5%) 15.19
(5.7%) -0.1% ( -10% - 11%) 0.971
MedIntervalsOrdered 43.08 (5.0%) 43.14
(4.6%) 0.1% ( -9% - 10%) 0.934
LowSloppyPhrase 54.27 (10.7%) 54.76
(10.1%) 0.9% ( -17% - 24%) 0.782
BrowseDateSSDVFacets 4.21 (12.4%) 4.26
(11.8%) 1.0% ( -20% - 28%) 0.784
PKLookup 240.85 (2.3%) 244.63
(1.9%) 1.6% ( -2% - 5%) 0.018
MedSloppyPhrase 15.08 (9.3%) 15.47
(9.8%) 2.6% ( -15% - 24%) 0.384
TermDTSort 104.10 (2.0%) 108.13
(3.5%) 3.9% ( -1% - 9%) 0.000
HighTermDayOfYearSort 103.69 (2.0%) 107.85
(3.5%) 4.0% ( -1% - 9%) 0.000
HighTermTitleBDVSort 106.26 (2.3%) 112.05
(4.1%) 5.5% ( 0% - 12%) 0.000
HighTermMonthSort 210.13 (2.5%) 224.43
(12.5%) 6.8% ( -7% - 22%) 0.017
LowTerm 754.49 (16.4%) 2902.39
(24.5%) 284.7% ( 209% - 389%) 0.000
MedTerm 251.51 (3.4%) 2585.44
(48.9%) 928.0% ( 846% - 1015%) 0.000
HighTerm 124.40 (3.1%) 1782.93
(74.1%) 1333.3% (1217% - 1456%) 0.000
IntNRQ 12.42 (10.8%) 308.10
(193.8%) 2380.8% (1963% - 2899%) 0.000
```
- IntNRQ and term queries benefit from this change the most because
`Weight#count` gives the hit count up-front which then enables skipping
non-competitive hits.
- Pure disjunctions suffer the most because BS1 is no longer used since the
weight doesn't know if hits will be skipped based on scores or not.
- Other scoring queries are impacted because they need to read impacts in
case the collector would like to skip based on scores.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]