mikemccand commented on PR #12688: URL: https://github.com/apache/lucene/pull/12688#issuecomment-1770448647
I'll try to review this soon -- it sounds compelling @Tony-X! I like how it is inspired by Tantivy's term dictionary format (which holds all terms + their metadata in RAM). Also, with the upcoming ability to [cleanly limit how much RAM the `FSTCompiler` is allowed to use to reduce the size of the FST](https://github.com/apache/lucene/pull/12633), this approach becomes more feasible. Without that change, the FST compilation might easily use excessive RAM during indexing when merging large segments. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org