jpountz commented on PR #14358:
URL: https://github.com/apache/lucene/pull/14358#issuecomment-2727467298

   I'm wary about adding all these micro-optimizations to reduce the 
per-segment per-field overhead. They hurt readability and may easily get lost 
over time when codecs get replaced with new codecs. Presumably, the motivation 
is to reduce heap usage when working with lots of shards / segments / fields? 
Could this be solved differently so that heap usage due to open indexes remains 
low by design rather than as a result of many micro-optimizations? E.g. by 
pooling index readers using a least-recently used policy, using a single Lucene 
field for multiple logical fields (like Elasticsearch's `flattened` does), 
merging segments more aggressively (e.g. by bumping the floor/min segment 
size), or a combination of all these things?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to