dungba88 commented on issue #12714:
URL: https://github.com/apache/lucene/issues/12714#issuecomment-1785150257

   I ran a small test to see what RAM would be needed for some sample 
dictionary using a simple `LinkedHashMap<UnCompiledNode, Long>`:
   
   6MB Cache size
   62457 items
   977KB FST size
   
   The reported RAM might be a bit under-estimated, and it seems cache size is 
6x the FST size. Using byte-array representation would be more efficient than 
naive UnCompiledNode (maybe down to 3x like Mike has reported). The benefit is 
simple implementation & maintenance.
   
   Perhaps instead of UnCompiledNode, we could encode it as byte-array (could 
take the same format as the FST-encoded binary, but the FST operation works on 
absolute address value thus would need some modification). In any case, this 
binary-encoded could use doubly linked list for LRU cache as well. I'm unsure 
what double barrel cache would bring, as they (at worst) would only have 1/2 
normal capacity.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to