[GitHub] [lucene] twosom opened a new pull request, #12045: fix typo in KoreanNumberFilter

2022-12-29 Thread GitBox


twosom opened a new pull request, #12045:
URL: https://github.com/apache/lucene/pull/12045

   ### Description
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] agorlenko commented on pull request #11946: add similarity threshold for hnsw

2022-12-29 Thread GitBox


agorlenko commented on PR #11946:
URL: https://github.com/apache/lucene/pull/11946#issuecomment-1367395855

   Ok, it seems that I should close this PR, shouldn't I? It is not difficult 
to implement such functionality in the code which uses lucene if it is 
necessary (in contrast to the first implementation).
   
   @msokolov what do you think?
   
   In any case, I thank you all for the discussion.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] uschindler commented on issue #12041: Replicator tests do not compile with Java 20

2022-12-29 Thread GitBox


uschindler commented on issue #12041:
URL: https://github.com/apache/lucene/issues/12041#issuecomment-1367520271

   Bug report openend at OpenJDK: https://bugs.openjdk.org/browse/JDK-8299416


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] mcimadamore commented on pull request #12042: Implement MMapDirectory with Java 20 Project Panama Preview API

2022-12-29 Thread GitBox


mcimadamore commented on PR #12042:
URL: https://github.com/apache/lucene/pull/12042#issuecomment-1367601892

   Thanks for testing @uschindler!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] vigyasharma commented on pull request #12013: Clear thread local values on UTF8TaxonomyWriterCache.close()

2022-12-29 Thread GitBox


vigyasharma commented on PR #12013:
URL: https://github.com/apache/lucene/pull/12013#issuecomment-1367688658

   > ... but I don't understand what this "cache" is doing and why it actually 
documents that it never frees memory.
   
   My understanding is that `TaxonomyWriterCache` caches ordinals for all 
categories created in the index so far, so that categories use the same 
ordinals when facet labels are added. 
   
   It seems that we need such a cache to get ordinals for newly added 
categories from documents that are still pending flush.
   From `TaxonomyWriterCache#put()` docstring:
   ```bash
  * The reason why the caller needs to know if part of the cache was 
cleared is that in that
  * case it will have to commit its on-disk index (so that all the latest 
category additions can be
  * searched on disk, if we can't rely on the cache to contain them).
   ```
   
   However, with faster BinaryDocValue fields, maybe we don't need the "never 
evicting UTF8TaxonomyWriterCache" anymore. And we could make 
LruTaxonomyWriterCache as the default?
   
   I can close this PR and make that change if it makes sense. Or we can merge 
this change and take it up in a separate issue. Are there faceting specific 
benchmarks that can help validate the change?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org