[GitHub] [lucene] dweiss merged pull request #11993: Upgrade gradle to 7.6.

2022-12-02 Thread GitBox
dweiss merged PR #11993: URL: https://github.com/apache/lucene/pull/11993 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

[GitHub] [lucene] dweiss merged pull request #11994: Upgrade gradle to 7.6 (backport)

2022-12-02 Thread GitBox
dweiss merged PR #11994: URL: https://github.com/apache/lucene/pull/11994 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335235418 > @uschindler I think this issue just have a GC path of ThreadLocals. BUT, for instance in ES, when there is a 1000-shard-nodes, and normally one shard with 40 segments per shard, one open

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335237178 In fact there is no situation where thousands of shards makes sense on a single node. That's bad design. -- This is an automated message from the Apache Git Service. To respond to the me

[GitHub] [lucene] benwtrent commented on issue #11963: Improve vector quantization API

2022-12-02 Thread GitBox
benwtrent commented on issue #11963: URL: https://github.com/apache/lucene/issues/11963#issuecomment-1335239951 @rmuir thank you! I currently have a major part of the refactor already written locally. But, it will be big and will need its own rounds of review to make sure its the way we wan

[GitHub] [lucene] benwtrent commented on a diff in pull request #11860: GITHUB-11830 Better optimize storage for vector connections

2022-12-02 Thread GitBox
benwtrent commented on code in PR #11860: URL: https://github.com/apache/lucene/pull/11860#discussion_r1038154379 ## lucene/backward-codecs/src/java/org/apache/lucene/backward_codecs/lucene94/ExpandingVectorValues.java: ## @@ -0,0 +1,49 @@ +/* + * Licensed to the Apache Software

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335269730 I will investigate the document API and try to make a proposal so that threadlocal is no longer needed. I'm really concerned about the merge case here causing regressions for folks that de

[GitHub] [lucene] luyuncheng commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
luyuncheng commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335302726 > In fact there is no situation where thousands of shards makes sense on a single node. That's bad design. @rmuir i have another proposal: what do you think of make `ByteBuf

[GitHub] [lucene] jpountz commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
jpountz commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335358751 I had opened a very similar PR to this one at https://github.com/apache/lucene/pull/137 which handled the merge case. -- This is an automated message from the Apache Git Service. To re

[GitHub] [lucene] jpountz commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
jpountz commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335365755 I think I had not merged it because the follow-up discussion about removing thread locals had triggered naming/API concerns, but it should be a good incremental step and we could figure

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335391991 > I think I had not merged it because the follow-up discussion about removing thread locals had triggered naming/API concerns, but it should be a good incremental step and we could figure

[GitHub] [lucene] jpountz commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
jpountz commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335392646 Sounds good. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[GitHub] [lucene] jpountz closed pull request #137: LUCENE-9955: Reduced state of stored fields readers.

2022-12-02 Thread GitBox
jpountz closed pull request #137: LUCENE-9955: Reduced state of stored fields readers. URL: https://github.com/apache/lucene/pull/137 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335398330 its just gonna take me some time, i can't get something out there like today. for example nearly 100% of tests would be impacted :) It is fair, I will feel the same pain the users will. Bu

[GitHub] [lucene] luyuncheng commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
luyuncheng commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335420862 Threadlocal just scale up the StoredFieldsReader's heap useage, BUT when one instance with only 10K segments would use 1G heap memory in `fieldsReaderOrig#buffer` without threadlocal.

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335444854 the idea is not to have one instance per segment. there would be zero instances. when you want to retrieve docs from indexreader, the user would call .getFieldsReader() or similar to creat

[GitHub] [lucene] rmuir commented on pull request #11987: Make Decompressor release memory buffer

2022-12-02 Thread GitBox
rmuir commented on PR #11987: URL: https://github.com/apache/lucene/pull/11987#issuecomment-1335456484 and one idea i have is to try to prototype with the term vectors first (since both stored fields and term vectors have per-segment threadlocals that I'd like to remove). it is just less te

[GitHub] [lucene] dnhatn commented on pull request #11984: Add exponential growth to TimeLimitingBulkScorer

2022-12-02 Thread GitBox
dnhatn commented on PR #11984: URL: https://github.com/apache/lucene/pull/11984#issuecomment-1335560505 Thanks @costin! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [lucene] dnhatn merged pull request #11984: Add exponential growth to TimeLimitingBulkScorer

2022-12-02 Thread GitBox
dnhatn merged PR #11984: URL: https://github.com/apache/lucene/pull/11984 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

[GitHub] [lucene] dnhatn closed issue #11676: Can TimeLimitingBulkScorer exponentially grow the window size? [LUCENE-10640]

2022-12-02 Thread GitBox
dnhatn closed issue #11676: Can TimeLimitingBulkScorer exponentially grow the window size? [LUCENE-10640] URL: https://github.com/apache/lucene/issues/11676 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [lucene] jpountz commented on a diff in pull request #11860: GITHUB-11830 Better optimize storage for vector connections

2022-12-02 Thread GitBox
jpountz commented on code in PR #11860: URL: https://github.com/apache/lucene/pull/11860#discussion_r1038244235 ## lucene/backward-codecs/src/java/org/apache/lucene/backward_codecs/lucene94/ExpandingVectorValues.java: ## @@ -0,0 +1,49 @@ +/* + * Licensed to the Apache Software F

[GitHub] [lucene] benwtrent commented on a diff in pull request #11860: GITHUB-11830 Better optimize storage for vector connections

2022-12-02 Thread GitBox
benwtrent commented on code in PR #11860: URL: https://github.com/apache/lucene/pull/11860#discussion_r1038415190 ## lucene/core/src/java/org/apache/lucene/codecs/lucene95/Lucene95HnswVectorsWriter.java: ## @@ -0,0 +1,753 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] [lucene] benwtrent commented on a diff in pull request #11860: GITHUB-11830 Better optimize storage for vector connections

2022-12-02 Thread GitBox
benwtrent commented on code in PR #11860: URL: https://github.com/apache/lucene/pull/11860#discussion_r1038420089 ## lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraph.java: ## @@ -144,6 +145,10 @@ public NodesIterator(int size) { this.size = size; } +p