gf2121 commented on PR #14176: URL: https://github.com/apache/lucene/pull/14176#issuecomment-2636202013
**Some new progress** > Luceneutil now can load 3 implementors of IntersectVisitor: RangeQuery Visitor, RangeQuery InverseVisitor and DynamicPruning Visitor. Here is the result on wikimediumall and taskCountPerCat=5 TaskQPS baseline StdDevQPS my_modified_version StdDev Pct diff p-value TermDayOfYearSort 259.87 (3.9%) 269.26 (4.2%) 3.6% ( -4% - 12%) 0.005 CountFilteredIntNRQ 61.70 (7.1%) 85.00 (2.0%) 37.8% ( 26% - 50%) 0.000 TermDTSort 149.65 (6.2%) 232.85 (9.6%) 55.6% ( 37% - 76%) 0.000 FilteredIntNRQ 82.76 (10.0%) 135.48 (3.7%) 63.7% ( 45% - 85%) 0.000 IntNRQ 84.62 (10.5%) 139.05 (2.6%) 64.3% ( 46% - 86%) 0.000 The previous result was got by `taskRepeatCount=20` . I find that when i increase taskRepeatCount to 50, the speedup disappeared: ``` TaskQPS baseline StdDevQPS my_modified_version StdDev Pct diff p-value TermDayOfYearSort 196.21 (8.7%) 194.85 (11.2%) -0.7% ( -18% - 21%) 0.871 CountFilteredIntNRQ 84.92 (13.1%) 84.84 (12.1%) -0.1% ( -22% - 28%) 0.987 IntNRQ 137.14 (20.2%) 137.30 (18.4%) 0.1% ( -31% - 48%) 0.989 FilteredIntNRQ 134.41 (20.0%) 135.05 (18.1%) 0.5% ( -31% - 48%) 0.954 TermDTSort 196.18 (9.0%) 201.19 (9.0%) 2.6% ( -14% - 22%) 0.506 ``` When i introduce a new task running `PointInSetQuery` , the speedup appears stably when `taskRepeatCount=50` or `taskRepeatCount=100`: ``` TaskQPS baseline StdDevQPS my_modified_version StdDev Pct diff p-value TermDayOfYearSort 200.79 (8.3%) 200.43 (8.5%) -0.2% ( -15% - 18%) 0.947 IntSet 1358.83 (3.5%) 1364.40 (3.6%) 0.4% ( -6% - 7%) 0.714 TermDTSort 200.55 (9.4%) 203.26 (8.2%) 1.3% ( -14% - 20%) 0.630 CountFilteredIntNRQ 61.05 (8.2%) 87.15 (7.9%) 42.7% ( 24% - 64%) 0.000 FilteredIntNRQ 82.13 (10.2%) 139.49 (9.7%) 69.9% ( 45% - 99%) 0.000 IntNRQ 83.22 (10.2%) 141.61 (9.5%) 70.2% ( 45% - 100%) 0.000 ``` The PR to introduce `PointInSetQuery` Task: https://github.com/mikemccand/luceneutil/pull/335. Maybe we should look into merge it before this PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org