jfreden commented on PR #13036:
URL: https://github.com/apache/lucene/pull/13036#issuecomment-1916440041

   I added code to only apply the optimization `if 
count(term-with-less-docs)/count(term-with-more-docs) < 0.1` and it yielded a 
way better result. Will investigate the term cache idea too since there is 
still a slowdown of `CountHigHigh`.
   ```
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
                    CountOrHighHigh       43.48      (3.1%)       42.94      
(3.1%)   -1.2% (  -7% -    5%) 0.201
                    CountAndHighMed      117.33      (1.6%)      116.68      
(2.2%)   -0.6% (  -4% -    3%) 0.357
                   CountAndHighHigh       13.40      (1.4%)       13.34      
(1.6%)   -0.5% (  -3% -    2%) 0.305
                     CountOrHighMed       44.02      (2.4%)       85.10      
(4.3%)   93.3% (  84% -  102%) 0.000
   ```
   ```
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
               HighIntervalsOrdered        2.45     (15.8%)        2.37     
(16.8%)   -3.3% ( -30% -   34%) 0.526
               BrowseDateSSDVFacets        1.77      (9.7%)        1.71     
(13.2%)   -3.1% ( -23% -   21%) 0.394
                             IntNRQ       26.64      (9.5%)       25.91     
(13.2%)   -2.8% ( -23% -   22%) 0.447
                  HighTermMonthSort     4247.44      (5.4%)     4174.43      
(6.8%)   -1.7% ( -13% -   11%) 0.379
                MedIntervalsOrdered        4.40      (9.0%)        4.34      
(9.3%)   -1.5% ( -18% -   18%) 0.597
                       OrNotHighLow     1013.74      (2.8%)     1000.22      
(4.0%)   -1.3% (  -7% -    5%) 0.223
                    CountOrHighHigh       43.48      (3.1%)       42.94      
(3.1%)   -1.2% (  -7% -    5%) 0.201
                         OrHighHigh       45.23      (7.6%)       44.71      
(7.1%)   -1.1% ( -14% -   14%) 0.624
                LowIntervalsOrdered       75.80      (6.3%)       74.94      
(7.0%)   -1.1% ( -13% -   12%) 0.591
                          OrHighLow      490.67      (4.2%)      485.46      
(3.7%)   -1.1% (  -8% -    7%) 0.394
                            LowTerm      782.27      (3.7%)      774.48      
(3.3%)   -1.0% (  -7% -    6%) 0.366
                       OrNotHighMed      327.78      (2.4%)      325.10      
(2.6%)   -0.8% (  -5% -    4%) 0.295
             OrHighMedDayTaxoFacets        7.49      (4.0%)        7.44      
(4.3%)   -0.7% (  -8% -    7%) 0.574
                         AndHighLow     1052.61      (4.4%)     1044.91      
(4.7%)   -0.7% (  -9% -    8%) 0.610
            AndHighMedDayTaxoFacets       31.62      (1.4%)       31.39      
(2.3%)   -0.7% (  -4% -    3%) 0.245
                    MedSloppyPhrase        4.55      (2.2%)        4.52      
(3.4%)   -0.6% (  -6% -    5%) 0.494
                   HighSloppyPhrase        7.84      (2.6%)        7.80      
(3.2%)   -0.6% (  -6% -    5%) 0.535
                    CountAndHighMed      117.33      (1.6%)      116.68      
(2.2%)   -0.6% (  -4% -    3%) 0.357
              HighTermDayOfYearSort      365.06      (3.2%)      363.03      
(3.3%)   -0.6% (  -6% -    6%) 0.590
                           Wildcard       35.27      (2.2%)       35.08      
(2.6%)   -0.6% (  -5% -    4%) 0.470
                         AndHighMed      149.03      (5.1%)      148.24      
(4.7%)   -0.5% (  -9% -    9%) 0.736
                        MedSpanNear        4.61      (2.7%)        4.58      
(3.3%)   -0.5% (  -6% -    5%) 0.588
                          MedPhrase       28.98      (4.8%)       28.83      
(5.1%)   -0.5% (  -9% -    9%) 0.743
                   CountAndHighHigh       13.40      (1.4%)       13.34      
(1.6%)   -0.5% (  -3% -    2%) 0.305
                         HighPhrase        8.65      (4.5%)        8.61      
(5.4%)   -0.5% (  -9% -    9%) 0.765
                    LowSloppyPhrase       29.81      (2.4%)       29.70      
(2.8%)   -0.4% (  -5% -    4%) 0.639
                      OrNotHighHigh      294.82      (3.6%)      293.71      
(2.9%)   -0.4% (  -6% -    6%) 0.719
                            MedTerm      719.95      (6.3%)      717.27      
(5.2%)   -0.4% ( -11% -   11%) 0.839
                            Respell       70.19      (2.4%)       69.94      
(2.1%)   -0.4% (  -4% -    4%) 0.607
                           PKLookup      288.07      (2.7%)      287.18      
(3.4%)   -0.3% (  -6% -    5%) 0.748
                            Prefix3      411.30      (1.9%)      410.09      
(2.5%)   -0.3% (  -4% -    4%) 0.675
                          OrHighMed      226.84      (5.1%)      226.17      
(4.0%)   -0.3% (  -8% -    9%) 0.842
                          LowPhrase       48.24      (4.1%)       48.15      
(4.3%)   -0.2% (  -8% -    8%) 0.889
                      OrHighNotHigh      464.84      (3.1%)      464.02      
(3.5%)   -0.2% (  -6% -    6%) 0.866
                             Fuzzy1      108.46      (1.9%)      108.40      
(1.9%)   -0.1% (  -3% -    3%) 0.930
                         TermDTSort      290.54      (5.1%)      290.58      
(5.5%)    0.0% ( -10% -   11%) 0.993
                       HighSpanNear       19.59      (2.1%)       19.60      
(2.2%)    0.1% (  -4% -    4%) 0.916
                             Fuzzy2      107.38      (1.8%)      107.48      
(2.3%)    0.1% (  -3% -    4%) 0.889
                       OrHighNotMed      452.75      (4.4%)      453.28      
(4.4%)    0.1% (  -8% -    9%) 0.933
                       OrHighNotLow      384.90      (4.4%)      385.42      
(5.1%)    0.1% (  -8% -   10%) 0.928
           AndHighHighDayTaxoFacets       18.84      (1.8%)       18.87      
(2.1%)    0.1% (  -3% -    4%) 0.816
                        LowSpanNear       57.00      (3.1%)       57.11      
(2.9%)    0.2% (  -5% -    6%) 0.841
                  HighTermTitleSort      244.47      (2.3%)      244.98      
(3.3%)    0.2% (  -5% -    5%) 0.821
                           HighTerm      715.29      (6.2%)      717.58      
(5.0%)    0.3% ( -10% -   12%) 0.857
                        AndHighHigh       61.54      (4.0%)       61.84      
(3.6%)    0.5% (  -6% -    8%) 0.687
               MedTermDayTaxoFacets       38.45      (1.9%)       38.78      
(2.3%)    0.9% (  -3% -    5%) 0.203
               HighTermTitleBDVSort       10.48      (4.6%)       10.63      
(7.4%)    1.4% ( -10% -   14%) 0.466
        BrowseRandomLabelTaxoFacets        7.36      (3.7%)        7.48     
(16.2%)    1.6% ( -17% -   22%) 0.675
              BrowseMonthSSDVFacets        6.64      (2.7%)        6.78      
(9.2%)    2.1% (  -9% -   14%) 0.330
          BrowseDayOfYearTaxoFacets        7.77      (4.2%)        7.93     
(14.0%)    2.2% ( -15% -   21%) 0.510
               BrowseDateTaxoFacets        7.68      (3.8%)        7.87     
(15.0%)    2.5% ( -15% -   22%) 0.468
        BrowseRandomLabelSSDVFacets        6.05      (9.2%)        6.22     
(11.2%)    2.8% ( -16% -   25%) 0.380
          BrowseDayOfYearSSDVFacets        6.69      (2.5%)        6.92      
(8.8%)    3.3% (  -7% -   15%) 0.103
              BrowseMonthTaxoFacets       11.69     (24.5%)       12.24     
(23.0%)    4.7% ( -34% -   69%) 0.534
                     CountOrHighMed       44.02      (2.4%)       85.10      
(4.3%)   93.3% (  84% -  102%) 0.000
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to