[GitHub] [lucene] benwtrent commented on pull request #12064: Create new KnnByteVectorField and KnnVectorsReader#getByteVectorValues(String)

2023-01-09 Thread GitBox
benwtrent commented on PR #12064: URL: https://github.com/apache/lucene/pull/12064#issuecomment-1375565264 > E.g. could hamming distance reuse the byte[] API by introducing a new distance function and half-float/bfloat16 reuse the float[] API? Hamming distance for binary vectors will

[GitHub] [lucene] vstrout commented on issue #12067: Getting exception on search after upgrading to Lucene 9.4

2023-01-09 Thread GitBox
vstrout commented on issue #12067: URL: https://github.com/apache/lucene/issues/12067#issuecomment-1375885952 Do you mind to point out what will be a proper solution then for case-insensitive sorting? The code from my example was working for us with all versions of Lucene prior to 9.4. Appr

[GitHub] [lucene] jpountz opened a new issue, #12068: Is it right to throttle the creation of compound files?

2023-01-09 Thread GitBox
jpountz opened a new issue, #12068: URL: https://github.com/apache/lucene/issues/12068 ### Description Our current merge throttling implementation also covers the creation of the compound file when `MergePolicy#useCompoundFile` returns true. But I don't think it is right and I suspec

[GitHub] [lucene] benwtrent opened a new issue, #12069: Long rewrite times for deeply nested, non-scoring Boolean queries

2023-01-09 Thread GitBox
benwtrent opened a new issue, #12069: URL: https://github.com/apache/lucene/issues/12069 ### Description When there is deeply nested Boolean clauses, the rewrite time can be exceptionally long in the non-scoring case. On my machine, this test takes 3+ seconds to run. I suspect

[GitHub] [lucene] benwtrent commented on issue #12069: Long rewrite times for deeply nested, non-scoring Boolean queries

2023-01-09 Thread GitBox
benwtrent commented on issue #12069: URL: https://github.com/apache/lucene/issues/12069#issuecomment-1376348361 @jpountz You might be interested in this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [lucene] msokolov commented on issue #11354: Reuse HNSW graphs when merging segments? [LUCENE-10318]

2023-01-09 Thread GitBox
msokolov commented on issue #11354: URL: https://github.com/apache/lucene/issues/11354#issuecomment-1376350472 yeah, thanks that seems to have reduced the noise some. Probably what remains is down to GC, system hiccups, etc; it's inevitable to see some variance. -- This is an automated m