[PR] Fix resource leak in loadMainDataFromFile [lucene]

2025-05-28 Thread via GitHub
xcx1r3 opened a new pull request, #14727: URL: https://github.com/apache/lucene/pull/14727 ### Description Use try-with-resources to auto-close DataInputStream ``` try (DataInputStream dctFile = new DataInputStream(Files.newInputStream(Paths.get(dctFilePath { ... }

Re: [PR] Fix resource leak in loadMainDataFromFile [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14727: URL: https://github.com/apache/lucene/pull/14727#issuecomment-2915210212 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Add a DoubleValuesSource for scoring full precision vector similarity [lucene]

2025-05-28 Thread via GitHub
vigyasharma commented on PR #14708: URL: https://github.com/apache/lucene/pull/14708#issuecomment-2917306575 Thanks for the review folks! I like the idea of a separate class and a custom vector comparator, will make these changes. -- This is an automated message from the Apache Git Servic

Re: [PR] Add a DoubleValuesSource for scoring full precision vector similarity [lucene]

2025-05-28 Thread via GitHub
vigyasharma commented on PR #14708: URL: https://github.com/apache/lucene/pull/14708#issuecomment-2917325435 I'm not sure about the byte vector case myself. Do we see a viable need for it in FullPrecisionVectorSimilaritySource ? -- This is an automated message from the Apache Git Service

Re: [PR] Add a DoubleValuesSource for scoring full precision vector similarity [lucene]

2025-05-28 Thread via GitHub
benwtrent commented on PR #14708: URL: https://github.com/apache/lucene/pull/14708#issuecomment-2917328559 > I'm not sure about the byte vector case myself. Do we see a viable need for it in FullPrecisionVectorSimilaritySource ? I am not sure. As of right now, none of the quantization

Re: [PR] Fix resource leak in loadMainDataFromFile [lucene]

2025-05-28 Thread via GitHub
jpountz commented on PR #14727: URL: https://github.com/apache/lucene/pull/14727#issuecomment-2916354220 Looks good! Can you add an entry in lucene/CHANGES.txt under version 10.3? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

Re: [PR] Only run the labeller on the main branch of the lucene repository [lucene]

2025-05-28 Thread via GitHub
dweiss commented on PR #14721: URL: https://github.com/apache/lucene/pull/14721#issuecomment-2916470586 @pseudo-nymous - if you can shed some light on this before I merge, it'd be great. I'll wait a bit for your feedback. -- This is an automated message from the Apache Git Service. To res

Re: [PR] Only run the labeller on the main branch of the lucene repository [lucene]

2025-05-28 Thread via GitHub
dweiss commented on PR #14721: URL: https://github.com/apache/lucene/pull/14721#issuecomment-2916469412 I've no idea. All I know is I get failures when I work on a self-fork PR, see here - https://github.com/dweiss/lucene/actions/workflows/label-pull-request.yml these recent "skipp

Re: [PR] deps(java): bump org.apache.groovy:groovy-all from 4.0.26 to 4.0.27 [lucene]

2025-05-28 Thread via GitHub
dweiss merged PR #14722: URL: https://github.com/apache/lucene/pull/14722 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

Re: [PR] deps(java): bump com.diffplug.spotless from 7.0.3 to 7.0.4 [lucene]

2025-05-28 Thread via GitHub
dweiss merged PR #14723: URL: https://github.com/apache/lucene/pull/14723 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

Re: [PR] Add a DoubleValuesSource for scoring full precision vector similarity [lucene]

2025-05-28 Thread via GitHub
msokolov commented on PR #14708: URL: https://github.com/apache/lucene/pull/14708#issuecomment-2916875650 +1 to add support for full precision re-ranking. Have you considered writing a FullPrecisionVectorSimilaritySource as a separate class? We like to avoid conditional logic on boolean pa

Re: [PR] Add a DoubleValuesSource for scoring full precision vector similarity [lucene]

2025-05-28 Thread via GitHub
benwtrent commented on PR #14708: URL: https://github.com/apache/lucene/pull/14708#issuecomment-2916888768 > Have you considered writing a FullPrecisionVectorSimilaritySource as a separate class? A separate class would allow users to provide a custom vector comparator, which might be

[PR] [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… [lucene]

2025-05-28 Thread via GitHub
Jinny-Wang opened a new pull request, #14728: URL: https://github.com/apache/lucene/pull/14728 #14565 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubs

Re: [PR] [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14728: URL: https://github.com/apache/lucene/pull/14728#issuecomment-2917468073 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… [lucene]

2025-05-28 Thread via GitHub
Jinny-Wang closed pull request #14728: [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… URL: https://github.com/apache/lucene/pull/14728 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [PR] [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14728: URL: https://github.com/apache/lucene/pull/14728#issuecomment-2917477162 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Reduce NeighborArray heap memory [lucene]

2025-05-28 Thread via GitHub
jainankitk commented on PR #14527: URL: https://github.com/apache/lucene/pull/14527#issuecomment-2917542707 Thanks @benwtrent and @weizijun for seeing this through! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the U

Re: [PR] Fix Method declared 'final' in 'final' class in LongHeap. [lucene]

2025-05-28 Thread via GitHub
msokolov merged PR #14712: URL: https://github.com/apache/lucene/pull/14712 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.ap

Re: [PR] Optimize AbstractKnnVectorQuery#createBitSet with intoBitset [lucene]

2025-05-28 Thread via GitHub
msokolov commented on code in PR #14674: URL: https://github.com/apache/lucene/pull/14674#discussion_r2112296471 ## lucene/core/src/java/org/apache/lucene/search/AbstractKnnVectorQuery.java: ## @@ -226,15 +227,25 @@ private BitSet createBitSet(DocIdSetIterator iterator, Bits li

Re: [PR] Optimize AbstractKnnVectorQuery#createBitSet with intoBitset [lucene]

2025-05-28 Thread via GitHub
msokolov commented on code in PR #14674: URL: https://github.com/apache/lucene/pull/14674#discussion_r2112296471 ## lucene/core/src/java/org/apache/lucene/search/AbstractKnnVectorQuery.java: ## @@ -226,15 +227,25 @@ private BitSet createBitSet(DocIdSetIterator iterator, Bits li

Re: [PR] Move HitQueue in TopScoreDocCollector to a LongHeap [lucene]

2025-05-28 Thread via GitHub
jpountz commented on code in PR #14714: URL: https://github.com/apache/lucene/pull/14714#discussion_r2111945886 ## lucene/core/src/java/org/apache/lucene/search/TopScoreDocCollector.java: ## @@ -73,23 +65,22 @@ public ScoreMode scoreMode() { public LeafCollector getLeafCollec

Re: [PR] Only run the labeller on the main branch of the lucene repository [lucene]

2025-05-28 Thread via GitHub
dweiss merged PR #14721: URL: https://github.com/apache/lucene/pull/14721 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

Re: [PR] Only run the labeller on the main branch of the lucene repository [lucene]

2025-05-28 Thread via GitHub
pseudo-nymous commented on PR #14721: URL: https://github.com/apache/lucene/pull/14721#issuecomment-2916739701 I'm also not sure about the permission issue. There have been past successful runs in forks. [Documentation](https://docs.github.com/en/rest/issues/labels?apiVersion=2022-11

Re: [PR] [BlockJoin] Add ParentsChildrenBlockJoinQuery to support parent and c… [lucene]

2025-05-28 Thread via GitHub
msfroh commented on code in PR #14728: URL: https://github.com/apache/lucene/pull/14728#discussion_r2112783861 ## lucene/join/src/test/org/apache/lucene/search/join/TestParentsChildrenBlockJoinQuery.java: ## @@ -0,0 +1,186 @@ +/* + * Licensed to the Apache Software Foundation (A

Re: [PR] Avoid unnecessary comparison for CELL_CROSSES_QUERY cases [lucene]

2025-05-28 Thread via GitHub
jainankitk merged PR #14626: URL: https://github.com/apache/lucene/pull/14626 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.

[PR] Support for Re-Ranking Queries using Late Interaction Model Multi-Vectors. [lucene]

2025-05-28 Thread via GitHub
vigyasharma opened a new pull request, #14729: URL: https://github.com/apache/lucene/pull/14729 Late Interaction models, like [ColBERT](https://arxiv.org/abs/2004.12832) and [ColPali](https://arxiv.org/html/2407.01449v2), capture rich semantic interaction between documents and queries, and

Re: [PR] Support for Re-Ranking Queries using Late Interaction Model Multi-Vectors. [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14729: URL: https://github.com/apache/lucene/pull/14729#issuecomment-2917887297 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Support for Re-Ranking Queries using Late Interaction Model Multi-Vectors. [lucene]

2025-05-28 Thread via GitHub
vigyasharma commented on PR #14729: URL: https://github.com/apache/lucene/pull/14729#issuecomment-2917890456 This change builds on the work shared [here](https://github.com/apache/lucene/pull/13525#issuecomment-2445295372) by @jimczi, thanks Jim! -- This is an automated message from the

Re: [PR] Fix documentation regarding benchmark running [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14667: URL: https://github.com/apache/lucene/pull/14667#issuecomment-2917902454 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Updating skip-changelog label [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14661: URL: https://github.com/apache/lucene/pull/14661#issuecomment-2917902497 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] No ruff violation [lucene]

2025-05-28 Thread via GitHub
github-actions[bot] commented on PR #14725: URL: https://github.com/apache/lucene/pull/14725#issuecomment-2917935696 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you wil

Re: [PR] Fix documentation regarding benchmark running [lucene]

2025-05-28 Thread via GitHub
viliam-durina commented on PR #14667: URL: https://github.com/apache/lucene/pull/14667#issuecomment-2918487383 @jainankitk Do you really want changelogs for this kind of changes? Nevertheless, I added it. -- This is an automated message from the Apache Git Service. To respond to the messa

Re: [PR] Fix resource leak in loadMainDataFromFile [lucene]

2025-05-28 Thread via GitHub
xcx1r3 commented on PR #14727: URL: https://github.com/apache/lucene/pull/14727#issuecomment-2918148345 It seems there were a couple more areas got same problem and I've added a few more commits to this PR which are ``` 1、org.apache.lucene.analysis.cn.smart.hhmm.BigramDictionary#loadF