Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2025-02-22 Thread via GitHub
github-actions[bot] commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2676466824 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2025-02-07 Thread via GitHub
mikemccand commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1946542491 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMap.java: ## @@ -218,12 +231,26 @@ public void add(CharsRef input, CharsRef output, boo

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2025-02-07 Thread via GitHub
mikemccand commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2642948906 Hi @msfroh -- thank you for the ping! Sorry for the slow reply ... I'll try to review again soon, and we might be able to test impact in our Amazon product search `SynonymGraphFilter

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2025-01-25 Thread via GitHub
github-actions[bot] commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2614148431 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2025-01-10 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2582003125 @mikemccand -- do you think this needs more work? Can you work with your team to see if this change would help reduce your heap usage? While we allow custom synonym files on AWS Ope

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-16 Thread via GitHub
github-actions[bot] commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2294482584 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1701032411 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1701025192 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1701014637 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1700987077 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1700999887 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMap.java: ## @@ -291,11 +306,35 @@ public SynonymMap build() throws IOException {

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-08-01 Thread via GitHub
msfroh commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1700987077 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-07-23 Thread via GitHub
mikemccand commented on code in PR #13054: URL: https://github.com/apache/lucene/pull/13054#discussion_r1687872625 ## lucene/analysis/common/src/java/org/apache/lucene/analysis/synonym/SynonymMapDirectory.java: ## @@ -0,0 +1,191 @@ +/* + * Licensed to the Apache Software Foundat

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-07-22 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2243997751 Note: The above PR has been merged -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-07-07 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2213024535 I published a PR here: https://github.com/apache/lucene/pull/13549. Please take a look when you have time! -- This is an automated message from the Apache Git Service. To respond to t

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-07-07 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2212975749 @msfroh I also forgot about this. Let me create a PR -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-06-08 Thread via GitHub
github-actions[bot] commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2156239861 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-05-24 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-2130737374 @dungba88 - I forgot about this change for a while. Did you create a separate PR for the saveMetadata change? Should I? -- This is an automated message from the Apache Git Service. To r

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-03-12 Thread via GitHub
github-actions[bot] commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1992798063 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-27 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1967141003 > I realized I also need the saveMetadata change for https://github.com/apache/lucene/pull/12985. Do you think we should make it a standalone PR and merge first? Otherwise I've cherry-pic

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-27 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1966724748 I realized I also need the `saveMetadata` change for https://github.com/apache/lucene/pull/12985. Do you think we should make it a standalone PR and merge first? Otherwise I've cherry-p

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-25 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1963312776 > I could put a PR for the saveMetadata change if you prefer. I'll update to take care of that. Thanks for the pointers! -- This is an automated message from the Apache Git Servic

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-25 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1963208003 I could put a PR for the saveMetadata change if you prefer. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-25 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1963122608 You are right, the saveMetadata is still in FST. Now to create the FST written off heap, you need to create the corresponding DataInput and use the FST constructor. However

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-25 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1963078077 > @msfroh > > As you only need to write the FST metadata, there is no need to create the FST. You can just call > > ``` > directory.writeMetadata(words.size(), maxHorizont

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-24 Thread via GitHub
dungba88 commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1962302474 @msfroh As you only need to write the FST metadata, there is no need to create the FST. You can just call ``` directory.writeMetadata(words.size(), maxHorizontalContex

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-23 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1961803620 @dungba88 -- I'm trying to resolve conflicts with your changes, but I'm a little stuck. I don't understand how we're supposed to use the FST APIs to write the FST to disk now. Aft

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-02-09 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1935491971 I decided to try experimenting with moving the output words back onto the heap, since I didn't the fact that every word lookup was triggering a seek. Running now, I got way less va

Re: [PR] Move synonym map off-heap for SynonymGraphFilter [lucene]

2024-01-31 Thread via GitHub
msfroh commented on PR #13054: URL: https://github.com/apache/lucene/pull/13054#issuecomment-1920420714 I did some rough benchmarks using the large synonym file attached to https://issues.apache.org/jira/browse/LUCENE-3233 The benchmark code and input is at https://github.com/msfroh/