[ 
https://issues.apache.org/jira/browse/LUCENE-9237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041329#comment-17041329
 ] 

Bruno Roustant commented on LUCENE-9237:
----------------------------------------

Luceneutil benchmark on wikimediumall, Lucene84 compared to UniformSplit:

TaskQPS Lucene84 StdDevQPS UniformSplit2 StdDev Pct diff
 Respell 42.88 (3.4%) 23.13 (1.4%) -46.1% ( -49% - -42%)
 Fuzzy2 50.71 (5.2%) 28.74 (2.1%) -43.3% ( -48% - -37%)
 Fuzzy1 51.41 (4.9%) 31.95 (2.3%) -37.9% ( -42% - -32%)
 Wildcard 48.02 (4.8%) 44.21 (3.2%) -7.9% ( -15% - 0%)
 HighTermMonthSort 74.90 (11.5%) 72.16 (14.0%) -3.7% ( -26% - 24%)
 IntNRQ 25.68 (18.8%) 25.15 (20.0%) -2.1% ( -34% - 45%)
 OrHighNotMed 632.99 (4.5%) 628.19 (4.4%) -0.8% ( -9% - 8%)
 HighTermDayOfYearSort 35.52 (7.3%) 35.33 (8.4%) -0.5% ( -15% - 16%)
BrowseDayOfYearTaxoFacets 1.00 (2.0%) 0.99 (1.3%) -0.4% ( -3% - 2%)
 HighSpanNear 5.65 (2.5%) 5.63 (2.7%) -0.3% ( -5% - 5%)
 BrowseDateTaxoFacets 1.00 (2.1%) 1.00 (1.5%) -0.1% ( -3% - 3%)
 MedSloppyPhrase 29.04 (3.3%) 29.02 (3.0%) -0.0% ( -6% - 6%)
 OrHighNotLow 646.22 (5.7%) 646.28 (5.4%) 0.0% ( -10% - 11%)
 LowSpanNear 12.58 (2.5%) 12.58 (2.5%) 0.0% ( -4% - 5%)
BrowseDayOfYearSSDVFacets 3.92 (1.9%) 3.93 (1.5%) 0.1% ( -3% - 3%)
 HighIntervalsOrdered 4.89 (2.5%) 4.89 (2.1%) 0.1% ( -4% - 4%)
 BrowseMonthSSDVFacets 4.39 (2.2%) 4.39 (1.4%) 0.2% ( -3% - 3%)
 MedSpanNear 10.31 (2.9%) 10.33 (3.2%) 0.2% ( -5% - 6%)
 LowSloppyPhrase 5.62 (3.1%) 5.63 (3.0%) 0.2% ( -5% - 6%)
 HighSloppyPhrase 30.20 (4.5%) 30.38 (4.4%) 0.6% ( -7% - 9%)
 LowPhrase 44.43 (3.1%) 44.79 (1.8%) 0.8% ( -3% - 5%)
 OrHighHigh 19.02 (3.2%) 19.19 (2.2%) 0.9% ( -4% - 6%)
 OrHighMed 40.58 (2.8%) 40.98 (2.5%) 1.0% ( -4% - 6%)
 BrowseMonthTaxoFacets 1.06 (2.1%) 1.08 (1.5%) 1.2% ( -2% - 4%)
 OrNotHighHigh 568.16 (4.0%) 575.63 (6.2%) 1.3% ( -8% - 11%)
 AndHighMed 77.35 (3.1%) 78.44 (3.5%) 1.4% ( -4% - 8%)
 OrHighNotHigh 559.98 (5.7%) 568.13 (5.8%) 1.5% ( -9% - 13%)
 PKLookup 176.23 (4.4%) 180.43 (5.2%) 2.4% ( -6% - 12%)
 OrNotHighLow 407.60 (5.3%) 418.01 (3.3%) 2.6% ( -5% - 11%)
 HighTerm 1407.02 (6.8%) 1447.11 (5.5%) 2.8% ( -8% - 16%)
 AndHighHigh 135.64 (3.7%) 139.81 (3.7%) 3.1% ( -4% - 10%)
 AndHighLow 418.33 (3.0%) 432.67 (4.9%) 3.4% ( -4% - 11%)
 MedPhrase 368.04 (4.5%) 382.66 (5.1%) 4.0% ( -5% - 14%)
 OrNotHighMed 567.06 (5.0%) 589.79 (5.6%) 4.0% ( -6% - 15%)
 MedTerm 1106.66 (5.9%) 1155.97 (3.7%) 4.5% ( -4% - 14%)
 OrHighLow 453.06 (6.1%) 474.49 (5.2%) 4.7% ( -6% - 16%)
 Prefix3 49.10 (17.7%) 51.44 (17.6%) 4.8% ( -20% - 44%)
 HighPhrase 296.12 (4.8%) 310.31 (4.3%) 4.8% ( -4% - 14%)
 LowTerm 1465.12 (6.1%) 1544.04 (5.5%) 5.4% ( -5% - 18%)

> Faster TermsEnum intersect for UniformSplit
> -------------------------------------------
>
>                 Key: LUCENE-9237
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9237
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Bruno Roustant
>            Assignee: Bruno Roustant
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> New version of TermsEnum intersect for UniformSplit. It is 70% more efficient 
> than the previous version for FuzzyQuery.
> Compared to BlockTree IntersectTermsEnum:
> - It is still slower for FuzzyQuery (-40%) but it is faster than the previous 
> version (which was -65%).
> - It is roughly same speed for WildcardQuery (-5%).
> - It is slightly faster for PrefixQuery (+5%). Sometimes benchmarks show more 
> improvement (I've seen up to +17% a third of the time).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to