[GitHub] [lucene] vsop-479 opened a new pull request, #12495: Fix comment on decode method in PForUtil

2023-08-08 Thread via GitHub
vsop-479 opened a new pull request, #12495: URL: https://github.com/apache/lucene/pull/12495 Fix comment on decode method from: 'Decode 128 integers into ints' to 'Decode 128 integers into longs' (same as ForUtil). -- This is an automated message from the Apache Git Service. To respond t

[GitHub] [lucene] zf853109035 commented on issue #12496: Cannot find implementing class for:org.apache.lucene.analysis.tokenattributes.PayloadAttribute

2023-08-08 Thread via GitHub
zf853109035 commented on issue #12496: URL: https://github.com/apache/lucene/issues/12496#issuecomment-1669506742 lucene version is:8.11.2 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spec

[GitHub] [lucene] benwtrent merged pull request #12493: Fix test failure with zero-length vector

2023-08-08 Thread via GitHub
benwtrent merged PR #12493: URL: https://github.com/apache/lucene/pull/12493 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.a

[GitHub] [lucene] jmazanec15 commented on issue #12342: Prevent VectorSimilarity.DOT_PRODUCT from returning negative scores

2023-08-08 Thread via GitHub
jmazanec15 commented on issue #12342: URL: https://github.com/apache/lucene/issues/12342#issuecomment-1669857468 @searchivarius I see, yes thats correct. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [lucene] benwtrent opened a new issue, #12497: Add Scalar Quantization codec for Vectors

2023-08-08 Thread via GitHub
benwtrent opened a new issue, #12497: URL: https://github.com/apache/lucene/issues/12497 ### Description Having copy-on-write segments lends itself nicely with quantization. I propose we add a new "scalar" or "linear" quantization codec. This will be a simple quantization codec provi

[GitHub] [lucene] donnerpeter merged pull request #12491: hunspell: simplify TrigramAutomaton to speed up the suggestion enumeration

2023-08-08 Thread via GitHub
donnerpeter merged PR #12491: URL: https://github.com/apache/lucene/pull/12491 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene

[GitHub] [lucene] jimczi commented on issue #12497: Add Scalar Quantization codec for Vectors

2023-08-08 Thread via GitHub
jimczi commented on issue #12497: URL: https://github.com/apache/lucene/issues/12497#issuecomment-1670739188 > I am not sure we can create the HNSW graph until all vectors are quantized. Some experimentation will have to be done here. It may be that creating the graph in a streaming fashion