Re: [PR] Merge DocAndFreqBuffer and DocAndScoreBuffer. [lucene]

2025-06-02 Thread via GitHub
jpountz merged PR #14748: URL: https://github.com/apache/lucene/pull/14748 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apa

Re: [PR] Merge CombinedFieldQuery's WeightedDisiWrapper into DisiWrapper. [lucene]

2025-06-02 Thread via GitHub
jpountz merged PR #14747: URL: https://github.com/apache/lucene/pull/14747 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apa

Re: [PR] Add code comment about use of Collections#synchronizedMap in LRUQueryCache. [lucene]

2025-06-02 Thread via GitHub
jpountz merged PR #14749: URL: https://github.com/apache/lucene/pull/14749 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apa

Re: [PR] Keep evaluating conjunction one doc-at-a-time until dynamic pruning kicks in. [lucene]

2025-06-02 Thread via GitHub
jpountz merged PR #14739: URL: https://github.com/apache/lucene/pull/14739 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apa

Re: [PR] Add code comment about use of Collections#synchronizedMap in LRUQueryCache. [lucene]

2025-06-02 Thread via GitHub
vsop-479 commented on PR #14749: URL: https://github.com/apache/lucene/pull/14749#issuecomment-2933058801 Thanks @jpountz ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? [lucene]

2025-06-02 Thread via GitHub
vsop-479 closed issue #14677: Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? URL: https://github.com/apache/lucene/issues/14677 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

Re: [I] Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? [lucene]

2025-06-02 Thread via GitHub
vsop-479 commented on issue #14677: URL: https://github.com/apache/lucene/issues/14677#issuecomment-2933056790 Oh, You are right @benwtrent ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
gf2121 commented on code in PR #14716: URL: https://github.com/apache/lucene/pull/14716#discussion_r2121634772 ## lucene/core/src/java/org/apache/lucene/index/PostingsEnum.java: ## @@ -63,6 +87,12 @@ public abstract class PostingsEnum extends DocIdSetIterator { */ public

[PR] Add MathUtil#unsignedMin to simplify dividing the doc ID space into windows. [lucene]

2025-06-02 Thread via GitHub
jpountz opened a new pull request, #14750: URL: https://github.com/apache/lucene/pull/14750 We often need to divide the doc ID space into sub windows, but this is prone to integer overflows if the size of the window is greater than `Integer.MAX_VALUE - windowStart`. This adds `MathUtil#unsi

[PR] Add code comment about use of Collections#synchronizedMap in LRUQueryCache. [lucene]

2025-06-02 Thread via GitHub
jpountz opened a new pull request, #14749: URL: https://github.com/apache/lucene/pull/14749 It may not be totally obvious why this map needs to be synchronized since reads and writes are performed under a read-write lock. Except that unlike most other collections, reads are not thread-safe

Re: [I] Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? [lucene]

2025-06-02 Thread via GitHub
jpountz commented on issue #14677: URL: https://github.com/apache/lucene/issues/14677#issuecomment-2931141062 Argh, of-couse, thank you @benwtrent! I opened a PR so that it's more obvious to readers of this code in the future: #14749. -- This is an automated message from the Apache Git Se

Re: [PR] Merge DocAndFreqBuffer and DocAndScoreBuffer. [lucene]

2025-06-02 Thread via GitHub
romseygeek commented on code in PR #14748: URL: https://github.com/apache/lucene/pull/14748#discussion_r2121218882 ## lucene/core/src/java/org/apache/lucene/search/DocAndFloatFeatureBuffer.java: ## @@ -21,29 +21,32 @@ import org.apache.lucene.util.IntsRef; /** - * Wrapper ar

Re: [I] Can we support vectors to be loaded with direct I/O for full precision re-ranking? [lucene]

2025-06-02 Thread via GitHub
jpountz commented on issue #14746: URL: https://github.com/apache/lucene/issues/14746#issuecomment-2930378258 This was the reasoning behind the introduction of `ReadAdvice.RANDOM`. By telling the operating system that the file has a random access pattern, it should be able to optimize for t

Re: [PR] Dynamic pruning with DocValueSkipper [lucene]

2025-06-02 Thread via GitHub
martijnvg commented on code in PR #14672: URL: https://github.com/apache/lucene/pull/14672#discussion_r2121019535 ## lucene/core/src/java/org/apache/lucene/search/comparators/NumericComparator.java: ## @@ -328,120 +507,47 @@ private void updateSkipInterval(boolean success) {

Re: [I] Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? [lucene]

2025-06-02 Thread via GitHub
benwtrent commented on issue #14677: URL: https://github.com/apache/lucene/issues/14677#issuecomment-2930565960 @vsop-479 @jpountz maybe my understanding of how LinkedHashMaps work is wrong. But: https://github.com/apache/lucene/pull/13306#discussion_r1578328936 It seems to me that if

Re: [I] Does LRUQueryCache#uniqueQueries really needs wrapped by Collections.synchronizedMap? [lucene]

2025-06-02 Thread via GitHub
jpountz commented on issue #14677: URL: https://github.com/apache/lucene/issues/14677#issuecomment-2930396347 It looks like you are right, we could remove the wrapper. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

Re: [I] Potential resource leakage in WordDictionary#loadMainDataFromFile [lucene]

2025-06-02 Thread via GitHub
jpountz closed issue #14719: Potential resource leakage in WordDictionary#loadMainDataFromFile URL: https://github.com/apache/lucene/issues/14719 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] Potential resource leakage in WordDictionary#loadMainDataFromFile [lucene]

2025-06-02 Thread via GitHub
jpountz commented on issue #14719: URL: https://github.com/apache/lucene/issues/14719#issuecomment-2930382910 Fixed via #14727 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[PR] Merge DocAndFreqBuffer and DocAndScoreBuffer. [lucene]

2025-06-02 Thread via GitHub
jpountz opened a new pull request, #14748: URL: https://github.com/apache/lucene/pull/14748 I initially kept those separated but I'm now coming to the conclusion that they should be merged: - While frequencies are ints today, some queries produce float frequencies (`SloppyPhraseQuery`,

[PR] Merge CombinedFieldQuery's WeightedDisiWrapper into DisiWrapper. [lucene]

2025-06-02 Thread via GitHub
jpountz opened a new pull request, #14747: URL: https://github.com/apache/lucene/pull/14747 This aims at simplifying things a bit to make it easier to implement `CombinedFieldScorer#nextDocsAndScores` later. -- This is an automated message from the Apache Git Service. To respond to the me

Re: [PR] Merge CombinedFieldQuery's WeightedDisiWrapper into DisiWrapper. [lucene]

2025-06-02 Thread via GitHub
github-actions[bot] commented on PR #14747: URL: https://github.com/apache/lucene/pull/14747#issuecomment-2929530021 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
github-actions[bot] commented on PR #14716: URL: https://github.com/apache/lucene/pull/14716#issuecomment-2929379478 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
jpountz commented on code in PR #14716: URL: https://github.com/apache/lucene/pull/14716#discussion_r2120387948 ## lucene/core/src/java/org/apache/lucene/index/PostingsEnum.java: ## @@ -63,6 +87,12 @@ public abstract class PostingsEnum extends DocIdSetIterator { */ public

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
jpountz commented on code in PR #14716: URL: https://github.com/apache/lucene/pull/14716#discussion_r2120389716 ## lucene/core/src/java/org/apache/lucene/index/PostingsEnum.java: ## @@ -63,6 +87,12 @@ public abstract class PostingsEnum extends DocIdSetIterator { */ public

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
jpountz commented on code in PR #14716: URL: https://github.com/apache/lucene/pull/14716#discussion_r2120356527 ## lucene/core/src/java/org/apache/lucene/index/PostingsEnum.java: ## @@ -63,6 +87,12 @@ public abstract class PostingsEnum extends DocIdSetIterator { */ public

Re: [PR] Merge PostingsEnum and ImpactsEnum. [lucene]

2025-06-02 Thread via GitHub
jpountz commented on code in PR #14716: URL: https://github.com/apache/lucene/pull/14716#discussion_r2120355953 ## lucene/core/src/java/org/apache/lucene/index/TermsEnum.java: ## @@ -170,11 +170,15 @@ public final PostingsEnum postings(PostingsEnum reuse) throws IOException {