Hello Björn, Take great care, 7.2.1 cannot read an index written by 7.4.0, so you cannot roll back but need to reindex!
Andrey Kudryavtsev made a good suggestion in the thread on how to find the culprit, but it will be a tedious task. I have not yet had the time or courage to venture there. Hope it helps, Markus -----Original message----- > From:Björn Häuser <bjoernhaeu...@gmail.com> > Sent: Monday 3rd September 2018 22:28 > To: solr-user@lucene.apache.org > Subject: Re: Heap Memory Problem after Upgrading to 7.4.0 > > Hi Markus, > > this reads exactly like what we have. Where you able to figure out anything? > Currently thinking about rollbacking to 7.2.1. > > > > > On 3. Sep 2018, at 21:54, Markus Jelsma <markus.jel...@openindex.io> wrote: > > > > Hello, > > > > Getting an OOM plus the fact you are having a lot of IndexSearcher > > instances rings a familiar bell. One of our collections has the same issue > > [1] when we attempted an upgrade 7.2.1 > 7.3.0. I managed to rule out all > > our custom Solr code but had to keep our Lucene filters in the schema, the > > problem persisted. > > > > The odd thing, however, is that you appear to have the same problem, but > > not with 7.3.0? Since you shortly after 7.3.0 upgraded to 7.4.0, can you > > confirm the problem is not also in 7.3.0? > > > > We had very similar problems with 7.3.0 but never analyzed them and just > updated to 7.4.0 because I thought thats the bug we hit: > https://issues.apache.org/jira/browse/SOLR-11882 > <https://issues.apache.org/jira/browse/SOLR-11882> > > > > You should see the instance count for IndexSearcher increase by one for > > each replica on each commit. > > > Sorry, where can I find this? ;) Sorry, did not find anything. > > Thanks > Björn > > > > > Regards, > > Markus > > > > [1] > > http://lucene.472066.n3.nabble.com/RE-7-3-appears-to-leak-td4396232.html > > > > > > > > -----Original message----- > >> From:Erick Erickson <erickerick...@gmail.com> > >> Sent: Monday 3rd September 2018 20:49 > >> To: solr-user <solr-user@lucene.apache.org> > >> Subject: Re: Heap Memory Problem after Upgrading to 7.4.0 > >> > >> I would expect at least 1 IndexSearcher per replica, how many total > >> replicas hosted in your JVM? > >> > >> Plus, if you're actively indexing, there may temporarily be 2 > >> IndexSearchers open while the new searcher warms. > >> > >> And there may be quite a few caches, at least queryResultCache and > >> filterCache and documentCache, one of each per replica and maybe two > >> (for queryResultCache and filterCache) if you have a background > >> searcher autowarming. > >> > >> At a glance, your autowarm counts are very high, so it may take some > >> time to autowarm leading to multiple IndexSearchers and caches open > >> per replica when you happen to hit a commit point. I usually start > >> with 16-20 as an autowarm count, the benefit decreases rapidly as you > >> increase the count. > >> > >> I'm not quite sure why it would be different in 7x .vs. 6x. How much > >> heap do you allocate to the JVM? And do you see similar heap dumps in > >> 6.6? > >> > >> Best, > >> Erick > >> On Mon, Sep 3, 2018 at 10:33 AM Björn Häuser <bjoernhaeu...@gmail.com> > >> wrote: > >>> > >>> Hello, > >>> > >>> we recently upgraded our solrcloud (5 nodes, 25 collections, 1 shard > >>> each, 4 replicas each) from 6.6.0 to 7.3.0 and shortly after to 7.4.0. We > >>> are running Zookeeper 4.1.13. > >>> > >>> Since the upgrade to 7.3.0 and also 7.4.0 we encountering heap space > >>> exhaustion. After obtaining a heap dump it looks like that we have a lot > >>> of IndexSearchers open for our largest collection. > >>> > >>> The dump contains around ~60 IndexSearchers, and each containing around > >>> ~40mb heap. Another 500MB of heap is the fieldcache, which is expected in > >>> my opinion. > >>> > >>> The current config can be found here: > >>> https://gist.github.com/bjoernhaeuser/327a65291ac9793e744b87f0a561e844 > >>> <https://gist.github.com/bjoernhaeuser/327a65291ac9793e744b87f0a561e844> > >>> > >>> Analyzing the heap dump eclipse MAT says this: > >>> > >>> Problem Suspect 1 > >>> > >>> 91 instances of "org.apache.solr.search.SolrIndexSearcher", loaded by > >>> "org.eclipse.jetty.webapp.WebAppClassLoader @ 0x6807d1048" occupy > >>> 1.981.148.336 (38,26%) bytes. > >>> > >>> Biggest instances: > >>> > >>> • org.apache.solr.search.SolrIndexSearcher @ 0x6ffd47ea8 - > >>>70.087.272 (1,35%) bytes. > >>> • org.apache.solr.search.SolrIndexSearcher @ 0x79ea9c040 - > >>>65.678.264 (1,27%) bytes. > >>> • org.apache.solr.search.SolrIndexSearcher @ 0x6855ad680 - > >>>63.050.600 (1,22%) bytes. > >>> > >>> > >>> Problem Suspect 2 > >>> > >>> 223 instances of "org.apache.solr.util.ConcurrentLRUCache", loaded by > >>> "org.eclipse.jetty.webapp.WebAppClassLoader @ 0x6807d1048" occupy > >>> 1.373.110.208 (26,52%) bytes. > >>> > >>> > >>> Any help is appreciated. Thank you very much! > >>> Björn > >> > >