Adrien Grand created LUCENE-9994:
------------------------------------

             Summary: Can IndexingChain better protect against large documents?
                 Key: LUCENE-9994
                 URL: https://issues.apache.org/jira/browse/LUCENE-9994
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Adrien Grand


It's easy for a single document to use several times the amount of RAM that is 
configured on IndexWriter by having many fields or many terms on a single 
field. Could we improve IndexingChain to reject such documents before they may 
cause an out-of-memory error? We could make such documents born deleted in the 
new segment like we already do when consuming a TokenStream raises an exception.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to