Re: [PR] [LUCENE-13044][replicator] NRT add configurable commitData for Custom… [lucene]

2024-03-01 Thread via GitHub
github-actions[bot] commented on PR #13045: URL: https://github.com/apache/lucene/pull/13045#issuecomment-1974120352 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [I] Should we explore DiskANN for aKNN vector search? [lucene]

2024-03-01 Thread via GitHub
kevindrosendahl commented on issue #12615: URL: https://github.com/apache/lucene/issues/12615#issuecomment-1974089752 Think I agree with your points @benwtrent, will just jot down my thinking on HNSW vs Vamana vs DiskANN in case it's useful. HNSW and Vamana are "competing" proximity g

[I] Openning IndexWriter with an existing commit incorrectly marks a index as changed [lucene]

2024-03-01 Thread via GitHub
Shibi-bala opened a new issue, #13151: URL: https://github.com/apache/lucene/issues/13151 ### Description Hey, I found what looks to be a bug here: https://github.com/apache/lucene/blob/releases/lucene/9.9.0/lucene/core/src/java/org/apache/lucene/index/IndexWriter.java#L1102

Re: [PR] Add Facets#getBulkSpecificValues method [lucene]

2024-03-01 Thread via GitHub
epotyom commented on PR #12862: URL: https://github.com/apache/lucene/pull/12862#issuecomment-1973648097 > [gsmiller](/gsmiller) > @epotyom I'd be happy to do a pass on this and help get it merged if you like. Would you mind resolving the current merge conflicts then I'll have

Re: [PR] Make Hnsw99Codecs Extensible [lucene]

2024-03-01 Thread via GitHub
uschindler commented on PR #13150: URL: https://github.com/apache/lucene/pull/13150#issuecomment-1973572849 Lucene prefers the delegation pattern. To prevent uncontrolled subclassing most of the components in Lucene are final. -- This is an automated message from the Apache Git Service. T

Re: [PR] Make Hnsw99Codecs Extensible [lucene]

2024-03-01 Thread via GitHub
uschindler closed pull request #13150: Make Hnsw99Codecs Extensible URL: https://github.com/apache/lucene/pull/13150 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[PR] Make Hnsw99Codecs Extensible [lucene]

2024-03-01 Thread via GitHub
Shibi-bala opened a new pull request, #13150: URL: https://github.com/apache/lucene/pull/13150 ### Description Lucene99HnswVectorsFormat and Lucene99HnswScalarQuantizedVectorsFormat are not extensible. With the new changes to how max vector dimension is set in the codec, users should be

Re: [I] Try encoding very frequent terms using a dense bitmap [lucene]

2024-03-01 Thread via GitHub
msokolov commented on issue #13147: URL: https://github.com/apache/lucene/issues/13147#issuecomment-1973428361 I have some initial implementation working in BlockDocsEnum, but one thing I'm unsure about is whether to provide it in all of the PostingsEnum/ImpactsEnum specializations. I feel

Re: [PR] Add WrappedCandidateMatcher for composing matchers [lucene]

2024-03-01 Thread via GitHub
romseygeek commented on PR #13109: URL: https://github.com/apache/lucene/pull/13109#issuecomment-1973412491 The `protected` visibility on `matchQuery()` should already be fine here - you can override or call protected methods from within subclasses. I think making `reportError()` and `fini

Re: [PR] Remove ByteBufferIndexInput and update all Panama implementations (MMap and Vector) to Java 21 [lucene]

2024-03-01 Thread via GitHub
uschindler commented on PR #13146: URL: https://github.com/apache/lucene/pull/13146#issuecomment-1973230903 > I did track it down to gradle's test listeners - it looks like a bug to me. Whatever this delegation is doing, even a simple check on whether the target method is already accessible

Re: [PR] Remove ByteBufferIndexInput and update all Panama implementations (MMap and Vector) to Java 21 [lucene]

2024-03-01 Thread via GitHub
dweiss commented on PR #13146: URL: https://github.com/apache/lucene/pull/13146#issuecomment-1973141394 I did track it down to gradle's test listeners - it looks like a bug to me. Whatever this delegation is doing, even a simple check on whether the target method is already accessible (or e

Re: [PR] Do not place Panama Java 21 class files in MR-JAR section of core.jar file [lucene]

2024-03-01 Thread via GitHub
dweiss commented on PR #13148: URL: https://github.com/apache/lucene/pull/13148#issuecomment-1973086127 No worries at all. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. T

Re: [PR] Do not place Panama Java 21 class files in MR-JAR section of core.jar file [lucene]

2024-03-01 Thread via GitHub
uschindler commented on PR #13148: URL: https://github.com/apache/lucene/pull/13148#issuecomment-1972920800 Thanks for fixing the typo. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [PR] LUCENE-9869 allow for configuring a custom cache purge scheduler in Monitor (aka Luwak) [lucene]

2024-03-01 Thread via GitHub
cpoerschke commented on code in PR #99: URL: https://github.com/apache/lucene/pull/99#discussion_r1508783653 ## lucene/monitor/src/java/org/apache/lucene/monitor/Monitor.java: ## @@ -55,7 +53,7 @@ public class Monitor implements Closeable { private final long commitBatchSiz

[PR] Made DocIdsWriter use DISI when reading documents with an IntersectVisitor [lucene]

2024-03-01 Thread via GitHub
antonha opened a new pull request, #13149: URL: https://github.com/apache/lucene/pull/13149 Instead of calling `IntersectVisitor.visit` for each doc in the `readDelta16` and `readInts32` methods, create a `DocIdSetIterator` and call `IntersectVisitor.visit(DocIdSetIterator)` instead.

Re: [PR] Terminate automaton after matched the whole prefix for PrefixQuery. [lucene]

2024-03-01 Thread via GitHub
vsop-479 commented on PR #13072: URL: https://github.com/apache/lucene/pull/13072#issuecomment-1972857051 There is still problem in the implementation of detecting whether accept state can match all suffix (`RunAutomaton.canMatchAllSuffix`), I am trying to figure it out. On the other

Re: [PR] Remove ByteBufferIndexInput and update all Panama implementations (MMap and Vector) to Java 21 [lucene]

2024-03-01 Thread via GitHub
uschindler commented on code in PR #13146: URL: https://github.com/apache/lucene/pull/13146#discussion_r1508734711 ## gradle/testing/randomization/policies/tests.policy: ## @@ -50,9 +50,6 @@ grant { permission java.lang.RuntimePermission "getStackTrace"; // needed for mock

Re: [PR] Add WrappedCandidateMatcher for composing matchers [lucene]

2024-03-01 Thread via GitHub
cpoerschke commented on PR #13109: URL: https://github.com/apache/lucene/pull/13109#issuecomment-1972848845 Thanks @bjacobowitz for a very detailed pull request here! I'm not really familiar with this area of the code but intuitively would be curious what the alternative (you mentione

Re: [PR] Remove ByteBufferIndexInput and update all Panama implementations (MMap and Vector) to Java 21 [lucene]

2024-03-01 Thread via GitHub
ChrisHegarty commented on code in PR #13146: URL: https://github.com/apache/lucene/pull/13146#discussion_r1508716978 ## gradle/testing/randomization/policies/tests.policy: ## @@ -50,9 +50,6 @@ grant { permission java.lang.RuntimePermission "getStackTrace"; // needed for mo

Re: [PR] Remove ByteBufferIndexInput and update all Panama implementations (MMap and Vector) to Java 21 [lucene]

2024-03-01 Thread via GitHub
uschindler commented on PR #13146: URL: https://github.com/apache/lucene/pull/13146#issuecomment-1972777086 > I'm not sure how you debugged that! I did not debug that; it was an observation and then try/error. I was a bit annoyed yesterday so here is my observations: - Jenkins