[GitHub] [lucene] SevenCss commented on issue #7820: CheckIndex cannot "fix" indexes that have individual segments with missing or corrupt .si files because sanity checks will fail trying to read the

2023-08-18 Thread via GitHub
SevenCss commented on issue #7820: URL: https://github.com/apache/lucene/issues/7820#issuecomment-1683471448 Hi , @asfimport Recently, i encountered one similar issue that there are two segments files in my index ( I'm using Lucene 8.10 version ) : - segments_a7 - segments_a8 wh

[GitHub] [lucene] mikemccand commented on issue #12513: Try out a tantivy's term dictionary format

2023-08-18 Thread via GitHub
mikemccand commented on issue #12513: URL: https://github.com/apache/lucene/issues/12513#issuecomment-1683667646 > it took me days to digest [Lucene90BlockTreeTermsWriter](https://lucene.apache.org/core/9_7_0/core/org/apache/lucene/codecs/lucene90/blocktree/Lucene90BlockTreeTermsWriter.html)

[GitHub] [lucene] almogtavor commented on issue #12406: Register nested queries (ToParentBlockJoinQuery) to Lucene Monitor

2023-08-18 Thread via GitHub
almogtavor commented on issue #12406: URL: https://github.com/apache/lucene/issues/12406#issuecomment-1683918288 @romseygeek @dweiss @uschindler @dsmiley @gsmiller I'd love to get feedback from you on the subject -- This is an automated message from the Apache Git Service. To respond to t

[GitHub] [lucene] easyice opened a new issue, #12514: Could we add more index for BKD LeafNode?

2023-08-18 Thread via GitHub
easyice opened a new issue, #12514: URL: https://github.com/apache/lucene/issues/12514 ### Description Currently in BKD LeafNode, we scan all 512 values and call `visitor.visit ` in `CELL_CROSSES_QUERY` case, this is usually not a issue for range query, but for point query, such as

[GitHub] [lucene] Tony-X commented on issue #12513: Try out a tantivy's term dictionary format

2023-08-18 Thread via GitHub
Tony-X commented on issue #12513: URL: https://github.com/apache/lucene/issues/12513#issuecomment-1684251100 Thanks @mikemccand for bringing in the context. I should've done that part better :) > FSTPostingsFormat is close? It holds all terms in a single FST (for each segment), and

[GitHub] [lucene] stefanvodita commented on a diff in pull request #12337: Index arbitrary fields in taxonomy docs

2023-08-18 Thread via GitHub
stefanvodita commented on code in PR #12337: URL: https://github.com/apache/lucene/pull/12337#discussion_r1298912817 ## lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/DirectoryTaxonomyIndexReader.java: ## @@ -0,0 +1,43 @@ +/* + * Licensed to the Apache Software