Re: [PR] Random access term dictionary [lucene]

via GitHub Thu, 19 Oct 2023 02:45:18 -0700


mikemccand commented on PR #12688:
URL: https://github.com/apache/lucene/pull/12688#issuecomment-1770448647


   I'll try to review this soon -- it sounds compelling @Tony-X!  I like how it 
is inspired by Tantivy's term dictionary format (which holds all terms + their 
metadata in RAM).
   
   Also, with the upcoming ability to [cleanly limit how much RAM the 
`FSTCompiler` is allowed to use to reduce the size of the 
FST](https://github.com/apache/lucene/pull/12633), this approach becomes more 
feasible.  Without that change, the FST compilation might easily use excessive 
RAM during indexing when merging large segments.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Random access term dictionary [lucene]

Reply via email to