[GitHub] [lucene] PenghaiZhang commented on issue #7362: Deprecate Filter [LUCENE-6301]

2023-08-29 Thread via GitHub
PenghaiZhang commented on issue #7362: URL: https://github.com/apache/lucene/issues/7362#issuecomment-1696914036 Is there any example about how to use Query instead of Filter, especially around the use of method `getDocIdSet` ? -- This is an automated message from the Apache Git Service.

[GitHub] [lucene] thomasdc opened a new pull request, #12525: Let WordDelimiterGraphFilterFactory propagate ignoreKeywords flag

2023-08-29 Thread via GitHub
thomasdc opened a new pull request, #12525: URL: https://github.com/apache/lucene/pull/12525 `WordDelimiterGraphFilter` already contains support for the `IGNORE_KEYWORDS` flag. This commits enables propagating that flag from the `WordDelimiterGraphFilterFactory`. Fixes https://github

[GitHub] [lucene] benwtrent merged pull request #12518: Use panama vector for l2normalize

2023-08-29 Thread via GitHub
benwtrent merged PR #12518: URL: https://github.com/apache/lucene/pull/12518 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.a

[GitHub] [lucene] mkhludnev commented on issue #7362: Deprecate Filter [LUCENE-6301]

2023-08-29 Thread via GitHub
mkhludnev commented on issue #7362: URL: https://github.com/apache/lucene/issues/7362#issuecomment-1697367875 > Is there any example about how to use Query instead of Filter, especially around the use of method `getDocIdSet` ? Hi, @PenghaiZhang . DocIdSet just let to iterate doc i

[GitHub] [lucene] mikemccand commented on issue #7820: CheckIndex cannot "fix" indexes that have individual segments with missing or corrupt .si files because sanity checks will fail trying to read th

2023-08-29 Thread via GitHub
mikemccand commented on issue #7820: URL: https://github.com/apache/lucene/issues/7820#issuecomment-1697605932 Thanks @SevenCss. We may never know what got your index into this state, unless it happens again. Let's focus this issue on improving `CheckIndex` to detect and correct (wit

[GitHub] [lucene] jpountz commented on pull request #12526: Speed up disjunctions by computing estimations of the score of the k-th top hit up-front.

2023-08-29 Thread via GitHub
jpountz commented on PR #12526: URL: https://github.com/apache/lucene/pull/12526#issuecomment-1698177593 Here are results on `wikimedium10m`. `OrHighHigh` and `OrHighMed` don't get a speedup because their minimum competitive scores compute pretty quickly anyway, but `OrHighHigh` sees a majo

[GitHub] [lucene] jpountz opened a new pull request, #12526: Speed up disjunctions by computing estimations of the score of the k-th top hit up-front.

2023-08-29 Thread via GitHub
jpountz opened a new pull request, #12526: URL: https://github.com/apache/lucene/pull/12526 Currently, our dynamic pruning logic for disjunctions updates the minimum competitive score as it sees more and more competitive hits. However this process can take time if some of the high-scoring c

[GitHub] [lucene] jpountz commented on pull request #12489: Add support for recursive graph bisection.

2023-08-29 Thread via GitHub
jpountz commented on PR #12489: URL: https://github.com/apache/lucene/pull/12489#issuecomment-1698178694 > So it takes much longer for dynamic pruning to start helping significantly. I suspect we have room for improvement to better deal with this sort of scenario. I opened https://gi

[GitHub] [lucene] PenghaiZhang commented on issue #7362: Deprecate Filter [LUCENE-6301]

2023-08-29 Thread via GitHub
PenghaiZhang commented on issue #7362: URL: https://github.com/apache/lucene/issues/7362#issuecomment-1698275146 > Hi @mkhludnev thanks for you quick response!! Actually my product had been using Lucene v3.6.2 for over a decade. I managed to upgrade it to v4 on last week, and now I

[GitHub] [lucene] PenghaiZhang commented on issue #8600: Update Lucene from 3.1.0 to 6.3.0 missing lot of library [LUCENE-7549]

2023-08-29 Thread via GitHub
PenghaiZhang commented on issue #8600: URL: https://github.com/apache/lucene/issues/8600#issuecomment-1698315841 I have checked the change log and migration guide for v5.5.5, but I cannot find any info about `FieldCacheRangeFilter`. -- This is an automated message from the Apache Git Ser

[GitHub] [lucene] jainankitk opened a new issue, #12527: Optimize readInts24 performance for DocIdsWriter

2023-08-29 Thread via GitHub
jainankitk opened a new issue, #12527: URL: https://github.com/apache/lucene/issues/12527 ### Description While recently [working on numeric range queries](https://github.com/opensearch-project/OpenSearch/issues/9541), I noticed readInts24 to be consuming significant CPU cycles. When

[GitHub] [lucene] PenghaiZhang commented on issue #7350: FieldCacheRangeFilter missing from MIGRATE.html [LUCENE-6288]

2023-08-29 Thread via GitHub
PenghaiZhang commented on issue #7350: URL: https://github.com/apache/lucene/issues/7350#issuecomment-1698334404 Same question - this one is neither mentioned in the migration guide nor the change log list. Can someone please give some advice ? -- This is an automated message from the Ap