gf2121 commented on PR #14176:
URL: https://github.com/apache/lucene/pull/14176#issuecomment-2636202013

   **Some new progress**
   
   > Luceneutil now can load 3 implementors of IntersectVisitor: RangeQuery 
Visitor, RangeQuery InverseVisitor and DynamicPruning Visitor. Here is the 
result on wikimediumall and taskCountPerCat=5
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
                  TermDayOfYearSort      259.87      (3.9%)      269.26      
(4.2%)    3.6% (  -4% -   12%) 0.005
                CountFilteredIntNRQ       61.70      (7.1%)       85.00      
(2.0%)   37.8% (  26% -   50%) 0.000
                         TermDTSort      149.65      (6.2%)      232.85      
(9.6%)   55.6% (  37% -   76%) 0.000
                     FilteredIntNRQ       82.76     (10.0%)      135.48      
(3.7%)   63.7% (  45% -   85%) 0.000
                             IntNRQ       84.62     (10.5%)      139.05      
(2.6%)   64.3% (  46% -   86%) 0.000
                             
   The previous result was got by  `taskRepeatCount=20` .  I find that when i 
increase taskRepeatCount to 50, the speedup disappeared:
   
   ```
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
                  TermDayOfYearSort      196.21      (8.7%)      194.85     
(11.2%)   -0.7% ( -18% -   21%) 0.871
                CountFilteredIntNRQ       84.92     (13.1%)       84.84     
(12.1%)   -0.1% ( -22% -   28%) 0.987
                             IntNRQ      137.14     (20.2%)      137.30     
(18.4%)    0.1% ( -31% -   48%) 0.989
                     FilteredIntNRQ      134.41     (20.0%)      135.05     
(18.1%)    0.5% ( -31% -   48%) 0.954
                         TermDTSort      196.18      (9.0%)      201.19      
(9.0%)    2.6% ( -14% -   22%) 0.506
   ```
   
   When i introduce a new task running `PointInSetQuery` , the speedup appears 
stably when `taskRepeatCount=50` or `taskRepeatCount=100`:
   
   ```
                               TaskQPS baseline      StdDevQPS 
my_modified_version      StdDev                Pct diff p-value
                  TermDayOfYearSort      200.79      (8.3%)      200.43      
(8.5%)   -0.2% ( -15% -   18%) 0.947
                             IntSet     1358.83      (3.5%)     1364.40      
(3.6%)    0.4% (  -6% -    7%) 0.714
                         TermDTSort      200.55      (9.4%)      203.26      
(8.2%)    1.3% ( -14% -   20%) 0.630
                CountFilteredIntNRQ       61.05      (8.2%)       87.15      
(7.9%)   42.7% (  24% -   64%) 0.000
                     FilteredIntNRQ       82.13     (10.2%)      139.49      
(9.7%)   69.9% (  45% -   99%) 0.000
                             IntNRQ       83.22     (10.2%)      141.61      
(9.5%)   70.2% (  45% -  100%) 0.000
   ```
   
   The PR to introduce `PointInSetQuery` Task: 
https://github.com/mikemccand/luceneutil/pull/335. Maybe we should look into 
merge it before this PR.
   
   
   
   
                             
                             


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to