[jira] [Created] (LUCENE-10680) UnifiedHighlighter's term extraction not working for some query rewrites

2022-08-17 Thread Yannick Welsch (Jira)
Yannick Welsch created LUCENE-10680: --- Summary: UnifiedHighlighter's term extraction not working for some query rewrites Key: LUCENE-10680 URL: https://issues.apache.org/jira/browse/LUCENE-10680 Proj

[jira] [Commented] (LUCENE-10680) UnifiedHighlighter's term extraction not working for some query rewrites

2022-08-17 Thread Alan Woodward (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580735#comment-17580735 ] Alan Woodward commented on LUCENE-10680: I think the `rewrite` call here is act

[jira] [Commented] (LUCENE-10680) UnifiedHighlighter's term extraction not working for some query rewrites

2022-08-17 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580874#comment-17580874 ] Julie Tibshirani commented on LUCENE-10680: --- Thanks for debugging this [~ywel

[jira] [Created] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
Luís Filipe Nassif created LUCENE-10681: --- Summary: ArrayIndexOutOfBoundsException while indexing large binary file Key: LUCENE-10681 URL: https://issues.apache.org/jira/browse/LUCENE-10681 Proje

[GitHub] [lucene] gsmiller commented on a diff in pull request #1013: LUCENE-10644: Facets#getAllChildren testing should ignore child order

2022-08-17 Thread GitBox
gsmiller commented on code in PR #1013: URL: https://github.com/apache/lucene/pull/1013#discussion_r948197598 ## lucene/facet/src/test/org/apache/lucene/facet/range/TestRangeFacetCounts.java: ## @@ -455,9 +500,9 @@ public void testEmptyRangesMultiValued() throws Exception {

[jira] [Updated] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
[ https://issues.apache.org/jira/browse/LUCENE-10681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated LUCENE-10681: Description: Hello, I looked for a similar issue, but didn't find one, so I'm cr

[jira] [Updated] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
[ https://issues.apache.org/jira/browse/LUCENE-10681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated LUCENE-10681: Description: Hello, I looked for a similar issue, but didn't find one, so I'm cr

[GitHub] [lucene] gsmiller commented on a diff in pull request #1058: LUCENE-10207: TermInSetQuery now provides a ScoreSupplier with cost estimation for use in TermInSetQuery

2022-08-17 Thread GitBox
gsmiller commented on code in PR #1058: URL: https://github.com/apache/lucene/pull/1058#discussion_r948265844 ## lucene/core/src/java/org/apache/lucene/search/TermInSetQuery.java: ## @@ -345,15 +345,62 @@ public BulkScorer bulkScorer(LeafReaderContext context) throws IOExceptio

[jira] [Commented] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Jack Mazanec (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580935#comment-17580935 ] Jack Mazanec commented on LUCENE-10318: --- Hi [~julietibs]  I was thinking about so

[GitHub] [lucene] gsmiller commented on a diff in pull request #1062: Optimize TermInSetQuery for terms that match all docs in a segment

2022-08-17 Thread GitBox
gsmiller commented on code in PR #1062: URL: https://github.com/apache/lucene/pull/1062#discussion_r948336077 ## lucene/core/src/java/org/apache/lucene/search/TermInSetQuery.java: ## @@ -363,6 +370,29 @@ public boolean isCacheable(LeafReaderContext ctx) { // sets.

[jira] [Updated] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
[ https://issues.apache.org/jira/browse/LUCENE-10681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated LUCENE-10681: Environment: Ubuntu 20.04 (LTS), java x64 version 11.0.16.1 (was: Linux Ubuntu (

[jira] [Updated] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
[ https://issues.apache.org/jira/browse/LUCENE-10681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated LUCENE-10681: Priority: Major (was: Minor) > ArrayIndexOutOfBoundsException while indexing lar

[jira] [Commented] (LUCENE-10681) ArrayIndexOutOfBoundsException while indexing large binary file

2022-08-17 Thread Jira
[ https://issues.apache.org/jira/browse/LUCENE-10681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580973#comment-17580973 ] Luís Filipe Nassif commented on LUCENE-10681: - Just changed the priority to

[jira] [Commented] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580979#comment-17580979 ] Mayya Sharipova commented on LUCENE-10318: -- Thanks for looking into this, Jack

[jira] [Comment Edited] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580979#comment-17580979 ] Mayya Sharipova edited comment on LUCENE-10318 at 8/17/22 8:01 PM: --

[jira] [Comment Edited] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17580979#comment-17580979 ] Mayya Sharipova edited comment on LUCENE-10318 at 8/17/22 8:02 PM: --

[jira] [Commented] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17581016#comment-17581016 ] Julie Tibshirani commented on LUCENE-10318: --- [~jmazanec15] it's great you're

[jira] [Comment Edited] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17581016#comment-17581016 ] Julie Tibshirani edited comment on LUCENE-10318 at 8/17/22 8:51 PM: -

[GitHub] [lucene] Yuti-G commented on a diff in pull request #1013: LUCENE-10644: Facets#getAllChildren testing should ignore child order

2022-08-17 Thread GitBox
Yuti-G commented on code in PR #1013: URL: https://github.com/apache/lucene/pull/1013#discussion_r948407058 ## lucene/facet/src/test/org/apache/lucene/facet/range/TestRangeFacetCounts.java: ## @@ -100,12 +100,21 @@ public void testBasicLong() throws Exception { new

[GitHub] [lucene] jtibshirani opened a new pull request, #1071: LUCENE-9583: Remove RandomAccessVectorValuesProducer

2022-08-17 Thread GitBox
jtibshirani opened a new pull request, #1071: URL: https://github.com/apache/lucene/pull/1071 This change folds the `RandomAccessVectorValuesProducer` interface into `RandomAccessVectorValues`. This reduces the number of interfaces and clarifies the cloning/ copying behavior. Th

[GitHub] [lucene] jtibshirani commented on a diff in pull request #1071: LUCENE-9583: Remove RandomAccessVectorValuesProducer

2022-08-17 Thread GitBox
jtibshirani commented on code in PR #1071: URL: https://github.com/apache/lucene/pull/1071#discussion_r948528112 ## lucene/core/src/test/org/apache/lucene/util/hnsw/KnnGraphTester.java: ## @@ -783,66 +742,6 @@ private static void usage() { System.exit(1); } - class Bi

[jira] [Updated] (LUCENE-10318) Reuse HNSW graphs when merging segments?

2022-08-17 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julie Tibshirani updated LUCENE-10318: -- Labels: vector-based-search (was: ) > Reuse HNSW graphs when merging segments? > ---

[GitHub] [lucene] gsmiller commented on a diff in pull request #1013: LUCENE-10644: Facets#getAllChildren testing should ignore child order

2022-08-17 Thread GitBox
gsmiller commented on code in PR #1013: URL: https://github.com/apache/lucene/pull/1013#discussion_r948541210 ## lucene/facet/src/test/org/apache/lucene/facet/range/TestRangeFacetCounts.java: ## @@ -100,12 +100,21 @@ public void testBasicLong() throws Exception { ne

[GitHub] [lucene] Yuti-G commented on a diff in pull request #1013: LUCENE-10644: Facets#getAllChildren testing should ignore child order

2022-08-17 Thread GitBox
Yuti-G commented on code in PR #1013: URL: https://github.com/apache/lucene/pull/1013#discussion_r948557786 ## lucene/facet/src/test/org/apache/lucene/facet/range/TestRangeFacetCounts.java: ## @@ -100,12 +100,21 @@ public void testBasicLong() throws Exception { new

[GitHub] [lucene] jtibshirani commented on a diff in pull request #1054: LUCENE-10577: enable quantization of HNSW vectors to 8 bits

2022-08-17 Thread GitBox
jtibshirani commented on code in PR #1054: URL: https://github.com/apache/lucene/pull/1054#discussion_r948548244 ## lucene/core/src/java/org/apache/lucene/search/KnnVectorQuery.java: ## @@ -133,22 +130,21 @@ private TopDocs searchLeaf(LeafReaderContext ctx, Weight filterWeight)