[jira] [Commented] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556673#comment-17556673 ] Adrien Grand commented on LUCENE-10624: --- I find these speedups surprising since I

[jira] [Comment Edited] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556662#comment-17556662 ] Weiming Wu edited comment on LUCENE-10624 at 6/21/22 6:16 AM: ---

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Description: h3. Problem Statement We noticed DocValue read performance regression with the iter

[jira] [Commented] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556662#comment-17556662 ] Weiming Wu commented on LUCENE-10624: - Added benchmark data to the content. > Bina

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Description: h3. Problem Statement We noticed DocValue read performance regression with the iter

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Description: h3. Problem Statement We noticed DocValue read performance regression with the iter

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Description: h3. Problem Statement We noticed DocValue read performance regression with the iter

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Attachment: candidate_sparseTaxis_searchsparse-sorted.0.log > Binary Search for Sparse IndexedDIS

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Attachment: baseline_sparseTaxis_searchsparse-sorted.0.log > Binary Search for Sparse IndexedDISI

[jira] [Updated] (LUCENE-10624) Binary Search for Sparse IndexedDISI advanceWithinBlock & advanceExactWithinBlock

2022-06-20 Thread Weiming Wu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiming Wu updated LUCENE-10624: Status: Patch Available (was: Open) > Binary Search for Sparse IndexedDISI advanceWithinBlock &

[GitHub] [lucene] jtibshirani commented on pull request #951: LUCENE-10606: Optimize Prefilter Hit Collection

2022-06-20 Thread GitBox
jtibshirani commented on PR #951: URL: https://github.com/apache/lucene/pull/951#issuecomment-1160945431 @kaivalnp just wanted to check how this is going. I'm excited about this improvement. Let me know if I can help with anything, for example I could dig into the questions that Adrien and

[jira] [Comment Edited] (LUCENE-10557) Migrate to GitHub issue from Jira

2022-06-20 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556475#comment-17556475 ] Tomoko Uchida edited comment on LUCENE-10557 at 6/20/22 5:20 PM:

[jira] [Commented] (LUCENE-10557) Migrate to GitHub issue from Jira

2022-06-20 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556475#comment-17556475 ] Tomoko Uchida commented on LUCENE-10557: I browsed through several JSON dumps o

[GitHub] [lucene] LuXugang commented on pull request #967: LUCENE-10623: Error implementation of docValueCount for SortingSortedSetDocValues

2022-06-20 Thread GitBox
LuXugang commented on PR #967: URL: https://github.com/apache/lucene/pull/967#issuecomment-1160497729 > Thanks for catching this bug. The fix is a bit wasteful in that it requires iterating over ords twice, once to count them and another time to iterate through them. Maybe we should change

[jira] [Commented] (LUCENE-10557) Migrate to GitHub issue from Jira

2022-06-20 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556362#comment-17556362 ] Tomoko Uchida commented on LUCENE-10557: As for User ID alignment, it'd be grea

[jira] [Resolved] (LUCENE-10618) Implement BooleanQuery rewrite rules based for minimumShouldMatch

2022-06-20 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrien Grand resolved LUCENE-10618. --- Fix Version/s: 9.3 Resolution: Fixed Thanks [~joe hou]! > Implement BooleanQuery re

[jira] [Commented] (LUCENE-10618) Implement BooleanQuery rewrite rules based for minimumShouldMatch

2022-06-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556345#comment-17556345 ] ASF subversion and git services commented on LUCENE-10618: -- Co

[GitHub] [lucene] jpountz merged pull request #965: LUCENE-10618: Implement BooleanQuery rewrite rules based for minimumShouldMatch

2022-06-20 Thread GitBox
jpountz merged PR #965: URL: https://github.com/apache/lucene/pull/965 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.

[jira] [Commented] (LUCENE-10557) Migrate to GitHub issue from Jira

2022-06-20 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556306#comment-17556306 ] Dawid Weiss commented on LUCENE-10557: -- I've verified that searches for old issue

[GitHub] [lucene] jpountz commented on pull request #967: LUCENE-10623: Error implementation of docValueCount for SortingSortedSetDocValues

2022-06-20 Thread GitBox
jpountz commented on PR #967: URL: https://github.com/apache/lucene/pull/967#issuecomment-1160204024 Thanks for catching this bug. The fix is a bit wasteful in that it requires iterating over ords twice, once to count them and another time to iterate through them. Maybe we should change `Do

[jira] [Commented] (LUCENE-10557) Migrate to GitHub issue from Jira

2022-06-20 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556288#comment-17556288 ] Tomoko Uchida commented on LUCENE-10557: > I've added a few bullet points that

[GitHub] [lucene] jpountz commented on pull request #964: LUCENE-10620: Pass the Weight to Collectors.

2022-06-20 Thread GitBox
jpountz commented on PR #964: URL: https://github.com/apache/lucene/pull/964#issuecomment-1160179472 Unfortunately this is challenging to do right at the moment since the API requires the collector to tell the `ScoreMode` it needs to be able to create the `Weight`. So either the collector s

[GitHub] [lucene] jpountz commented on pull request #964: LUCENE-10620: Pass the Weight to Collectors.

2022-06-20 Thread GitBox
jpountz commented on PR #964: URL: https://github.com/apache/lucene/pull/964#issuecomment-1160079281 Now when collectors need to count hits too (I changed IndexSearcher's `TOTAL_HITS_THRESHOLD` to `Integer.MAX_VALUE`): ``` TaskQPS baseline StdDevQPS