Re: [PR] Add a Better Binary Quantizer format for dense vectors [lucene]

2024-12-13 Thread via GitHub
gaoj0017 commented on PR #13651: URL: https://github.com/apache/lucene/pull/13651#issuecomment-2542789836 Thanks, Tanya @tanyaroosta , for sharing our blog about RaBitQ in this thread. I am the first author of the [RaBitQ paper](https://arxiv.org/abs/2405.12497). I am glad to know that our

Re: [PR] Add Query for reranking KnnFloatVectorQuery with full-precision vectors [lucene]

2024-12-13 Thread via GitHub
github-actions[bot] commented on PR #14009: URL: https://github.com/apache/lucene/pull/14009#issuecomment-2542583957 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

[I] Missing word on Brazillian stop word list [lucene]

2024-12-13 Thread via GitHub
eusousu opened a new issue, #14065: URL: https://github.com/apache/lucene/issues/14065 ### Description In brazillian portuguese the conjuntion "em(preposition)+(article)" take the form "na, nas, no, nos" being commom stop words. On the brazillian [stopwords.txt](https://githu

Re: [I] releaseWizard.py DeprecationWarning: datetime.datetime.utcnow() is deprecated [lucene]

2024-12-13 Thread via GitHub
rmuir commented on issue #14063: URL: https://github.com/apache/lucene/issues/14063#issuecomment-2541556163 change to `datetime.now(tz=UTC)` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [I] Add 9.12.1 Index version constant and backward compat tests [lucene]

2024-12-13 Thread via GitHub
ChrisHegarty commented on issue #14060: URL: https://github.com/apache/lucene/issues/14060#issuecomment-2541532695 BWC tests for 9.12.1 - https://github.com/apache/lucene/commit/41858faa90d8753a6e97d6f667e221fc1ecaacdf -- This is an automated message from the Apache Git Service. To respon

Re: [I] Add 9.12.1 Index version constant and backward compat tests [lucene]

2024-12-13 Thread via GitHub
ChrisHegarty closed issue #14060: Add 9.12.1 Index version constant and backward compat tests URL: https://github.com/apache/lucene/issues/14060 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] Allow doc ID set iterators and scorers to optimize bulk iteration / scoring. [lucene]

2024-12-13 Thread via GitHub
jpountz commented on PR #14064: URL: https://github.com/apache/lucene/pull/14064#issuecomment-2541411484 Here are results on wikibigall for exhaustive evaluation (totalHitsThreshold = Integer.MAX_VALUE) on both the baseline and the candidate: ``` TaskQPS

Re: [PR] Allow doc ID set iterators and scorers to optimize bulk iteration / scoring. [lucene]

2024-12-13 Thread via GitHub
jpountz commented on PR #14064: URL: https://github.com/apache/lucene/pull/14064#issuecomment-2541412414 I made this PR a draft while I'm still thinking of whether there is a better way of getting the same benefits. -- This is an automated message from the Apache Git Service. To respond t

[PR] Allow doc ID set iterators and scorers to optimize bulk iteration / scoring. [lucene]

2024-12-13 Thread via GitHub
jpountz opened a new pull request, #14064: URL: https://github.com/apache/lucene/pull/14064 While trying to understand while Tantivy is still much faster than Lucene at exhaustive evaluation of disjunctions, I noticed that there is significant potential for vectorization that Lucene doesn't

[PR] Check uptTo before score in ReqExclBulkScorer#score. [lucene]

2024-12-13 Thread via GitHub
vsop-479 opened a new pull request, #14062: URL: https://github.com/apache/lucene/pull/14062 ### Description -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To