[GitHub] [lucene] mikemccand commented on issue #12185: Using DirectIODirectory results in BufferOverflowException

2023-03-23 Thread via GitHub
mikemccand commented on issue #12185: URL: https://github.com/apache/lucene/issues/12185#issuecomment-1481241581 > As an aside, in some standard benchmark tests I run with our product, I have found the final optimisation of Lucene indexes after all the data has been indexed took 36 seconds

[GitHub] [lucene] mikemccand commented on issue #12185: Using DirectIODirectory results in BufferOverflowException

2023-03-23 Thread via GitHub
mikemccand commented on issue #12185: URL: https://github.com/apache/lucene/issues/12185#issuecomment-1481243567 > Using the standard Directory implementations of Lucene work fine on this NFS filesystem, so perhaps there is a corner case here which is not handled? Ahh this is also int

[GitHub] [lucene] fthevenet opened a new issue, #12211: Searches made via DrillSideways may miss documents that should match the query.

2023-03-23 Thread via GitHub
fthevenet opened a new issue, #12211: URL: https://github.com/apache/lucene/issues/12211 ### Description Hi, I use `DrillSideways` quite heavily in a project of mine and it recently realized that sometimes some documents that *should* match a query do not, whenever at least on

[GitHub] [lucene] fthevenet opened a new pull request, #12212: Fixes Searches made via DrillSideways may miss documents that should match the query

2023-03-23 Thread via GitHub
fthevenet opened a new pull request, #12212: URL: https://github.com/apache/lucene/pull/12212 This PR aims to address issue #12211: Searches made via DrillSideways may miss documents that should match the query. A more detailed explanation of the issue and the reasoning behind the fix

[GitHub] [lucene] benwtrent commented on pull request #12191: Increase KnnByteVectorField limit on dimensions to 2048

2023-03-23 Thread via GitHub
benwtrent commented on PR #12191: URL: https://github.com/apache/lucene/pull/12191#issuecomment-1481641849 > So I'm still challenging the use-case for this. "because users ask for it" is not enough for me, given the maturity level of these technologies. @rmuir Ah, I thought your main

[GitHub] [lucene] gsmiller commented on issue #12211: Searches made via DrillSideways may miss documents that should match the query.

2023-03-23 Thread via GitHub
gsmiller commented on issue #12211: URL: https://github.com/apache/lucene/issues/12211#issuecomment-1481642864 Great find, and great bug report! Thank you so much! Having a look at the PR now, but it seems reasonable to me. -- This is an automated message from the Apache Git Service. To r

[GitHub] [lucene] gsmiller commented on pull request #12212: Fixes Searches made via DrillSideways may miss documents that should match the query

2023-03-23 Thread via GitHub
gsmiller commented on PR #12212: URL: https://github.com/apache/lucene/pull/12212#issuecomment-1481732555 This looks great! Thanks @fthevenet. How hard would it be to add test that demonstrates the bug? -- This is an automated message from the Apache Git Service. To respond to the message

[GitHub] [lucene] benwtrent commented on pull request #12197: [Backport] GITHUB-11838 Add api to allow concurrent query rewrite

2023-03-23 Thread via GitHub
benwtrent commented on PR #12197: URL: https://github.com/apache/lucene/pull/12197#issuecomment-1481844580 I ran lucene-bench with `wikimedium10k` against this branch and 9x. There are some differences, but none seem statistically significant. ``` Int

[GitHub] [lucene] fthevenet commented on pull request #12212: Fixes Searches made via DrillSideways may miss documents that should match the query

2023-03-23 Thread via GitHub
fthevenet commented on PR #12212: URL: https://github.com/apache/lucene/pull/12212#issuecomment-1481849579 I _think_ it should be possible to create an index with only a few docs with only a couple of fields in it (a text field to do the phrase query on and a facet to enable drill down).

[GitHub] [lucene] uschindler commented on pull request #12197: [Backport] GITHUB-11838 Add api to allow concurrent query rewrite

2023-03-23 Thread via GitHub
uschindler commented on PR #12197: URL: https://github.com/apache/lucene/pull/12197#issuecomment-1481886324 Hi, Will check the virtual method code. Sorry was/am on business trip. Uwe -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [lucene] gsmiller commented on pull request #12212: Fixes Searches made via DrillSideways may miss documents that should match the query

2023-03-23 Thread via GitHub
gsmiller commented on PR #12212: URL: https://github.com/apache/lucene/pull/12212#issuecomment-1482041499 That's @fthevenet! Always great if we can repro in a test case. The fix looks right to me though. Thanks again! -- This is an automated message from the Apache Git Service. To respond

[GitHub] [lucene] hongyuyan97 opened a new issue, #12213: Ordered intervals over interleaved terms

2023-03-23 Thread via GitHub
hongyuyan97 opened a new issue, #12213: URL: https://github.com/apache/lucene/issues/12213 ### Description Here is my input text 'A B A C A B C' and search ORDERED(A, B, C) It should hits [0,3] and [4,6], but it will ignore the [4,6]. The reason is similar to this issue [LUCENE-

[GitHub] [lucene] hongyuyan97 opened a new pull request, #12214: fix ordered intervals query and add test case

2023-03-23 Thread via GitHub
hongyuyan97 opened a new pull request, #12214: URL: https://github.com/apache/lucene/pull/12214 ### Description This PR aims to address issue #12213 Ordered intervals over interleaved terms. A more detailed explanation of the issue and the reasoning behind the fix can be found in