[GitHub] [lucene] zf853109035 opened a new issue, #12441: IndexSearcher.doc(int docID, Set fieldsToLoad) method is so slow?

2023-07-15 Thread via GitHub
zf853109035 opened a new issue, #12441: URL: https://github.com/apache/lucene/issues/12441 ### Description I created a file-related index and ten 1 MB files. When I did not store the file content, I ran the doc(int docID, Set fieldsToLoad) of the IndexSearcher class ten times, and th

[GitHub] [lucene] mkhludnev commented on issue #12441: IndexSearcher.doc(int docID, Set fieldsToLoad) method is so slow?

2023-07-15 Thread via GitHub
mkhludnev commented on issue #12441: URL: https://github.com/apache/lucene/issues/12441#issuecomment-1636702962 It's by-design: whole block of records need to be decompressed and iterated through. Perhaps docValues (eg binary) might provide some sort of selectivity. -- This is an automa

[GitHub] [lucene] mkhludnev closed issue #12441: IndexSearcher.doc(int docID, Set fieldsToLoad) method is so slow?

2023-07-15 Thread via GitHub
mkhludnev closed issue #12441: IndexSearcher.doc(int docID, Set fieldsToLoad) method is so slow? URL: https://github.com/apache/lucene/issues/12441 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[GitHub] [lucene] ChrisHegarty commented on pull request #12417: forutil add vectorized and scalar code

2023-07-15 Thread via GitHub
ChrisHegarty commented on PR #12417: URL: https://github.com/apache/lucene/pull/12417#issuecomment-1636713748 Apologies for my tardy and terse interaction here. I've been otherwise preoccupied. I hope to spend time on this soon. -- This is an automated message from the Apache Git Service.

[GitHub] [lucene] uschindler commented on pull request #12417: forutil add vectorized and scalar code

2023-07-15 Thread via GitHub
uschindler commented on PR #12417: URL: https://github.com/apache/lucene/pull/12417#issuecomment-1636714292 > Note that these benchmarks were running with jdk19 (not 20), so it's possible we'd see something different with 20? Lucene enables and compiles the vectorized code only for jd

[GitHub] [lucene] stefanvodita opened a new pull request, #12442: Assert IdxOrDvQuery subqueries and document useful fields

2023-07-15 Thread via GitHub
stefanvodita opened a new pull request, #12442: URL: https://github.com/apache/lucene/pull/12442 This is a follow-up from #12426. We introduce assertions in `TestIndexOrDocValuesQuery` that the two wrapped queries are behaving the same way and we document fields that produce indexed structu

[GitHub] [lucene] stefanvodita commented on a diff in pull request #12442: Assert IdxOrDvQuery subqueries and document useful fields

2023-07-15 Thread via GitHub
stefanvodita commented on code in PR #12442: URL: https://github.com/apache/lucene/pull/12442#discussion_r1264364625 ## lucene/test-framework/src/java/org/apache/lucene/tests/search/QueryUtils.java: ## @@ -675,7 +675,14 @@ public static void checkBulkScorerSkipTo(Random r, Query

[GitHub] [lucene] stefanvodita commented on pull request #12426: Introduce VerifyingQuery

2023-07-15 Thread via GitHub
stefanvodita commented on PR #12426: URL: https://github.com/apache/lucene/pull/12426#issuecomment-1636716527 Thank you for the suggestions for `IndexOrDocValuesQuery`! I’ve opened a separate [PR](https://github.com/apache/lucene/pull/12442) to address them. Let me know if it matches what y

[GitHub] [lucene] rmuir commented on pull request #12417: forutil add vectorized and scalar code

2023-07-15 Thread via GitHub
rmuir commented on PR #12417: URL: https://github.com/apache/lucene/pull/12417#issuecomment-1636842807 please, lets not use this integer vectorization when `hasFastIntegerVectors` is false. Otherwise we can see 30x or so slowdown on virtualmachines without properly plumbed AVX. -- This i