Do you have indexing traffic going to it? b/c this _looks_
like the node is just starting up or a searcher is
being opened and you're loading your
index first time. This happens when you index data and
when you start up your nodes. Adding some autowarming
(firstSearcher in this case) might load up the underlying
caches earlier. This could also be a problem due to
very short commit intervals, although this latter should
be identical for both nodes.

And when you say 2 solr nodes, is this one shard or two?

I'm guessing that you have some setting that's significantly
different, memory perhaps?

Best,
Erick



On Wed, Sep 3, 2014 at 2:40 PM, Ethan <eh198...@gmail.com> wrote:
> Forgot to add the source thread thats blocking every other thread
>
>
> "http-bio-52158-exec-61" - Thread t@591
>    java.lang.Thread.State: RUNNABLE
>  at
> org.apache.lucene.search.FieldCacheImpl$Uninvert.uninvert(FieldCacheImpl.java:312)
> at
> org.apache.lucene.search.FieldCacheImpl$LongCache.createValue(FieldCacheImpl.java:986)
>  at
> org.apache.lucene.search.FieldCacheImpl$Cache.get(FieldCacheImpl.java:212)
> - locked org.apache.lucene.search.FieldCache$CreationPlaceholder@29e0400b
>  at
> org.apache.lucene.search.FieldCacheImpl.getLongs(FieldCacheImpl.java:901)
> at
> org.apache.lucene.search.FieldComparator$LongComparator.setNextReader(FieldComparator.java:685)
>  at
> org.apache.lucene.search.TopFieldCollector$OneComparatorNonScoringCollector.setNextReader(TopFieldCollector.java:97)
> at
> org.apache.lucene.search.TimeLimitingCollector.setNextReader(TimeLimitingCollector.java:158)
>  at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:618)
> at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:297)
>  at
> org.apache.solr.search.SolrIndexSearcher.getDocListNC(SolrIndexSearcher.java:1501)
> at
> org.apache.solr.search.SolrIndexSearcher.getDocListC(SolrIndexSearcher.java:1367)
>  at
> org.apache.solr.search.SolrIndexSearcher.search(SolrIndexSearcher.java:474)
> at
> org.apache.solr.handler.component.QueryComponent.process(QueryComponent.java:434)
>  at
> org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208)
> at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
>  at org.apache.solr.core.SolrCore.execute(SolrCore.java:1859)
> at
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:703)
>  at
> com.trimp.search.filter.LogAndAuthFilter.execute(LogAndAuthFilter.scala:109)
> at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:406)
>  at
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:195)
> at
> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
>  at
> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
> at
> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:222)
>  at
> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:123)
> at
> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:171)
>  at
> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:99)
> at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:947)
>  at org.apache.catalina.valves.RemoteIpValve.invoke(RemoteIpValve.java:680)
> at
> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:118)
>  at
> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:408)
> at
> org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1009)
>  at
> org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:589)
> at
> org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:310)
>  - locked org.apache.tomcat.util.net.SocketWrapper@7826692
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>  at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
> at java.lang.Thread.run(Thread.java:722)
>
>    Locked ownable synchronizers:
> - locked java.util.concurrent.ThreadPoolExecutor$Worker@2463aef
>
>
> On Wed, Sep 3, 2014 at 2:31 PM, Ethan <eh198...@gmail.com> wrote:
>
>> We have SolrCloud instance with 2 solr nodes and 3 zk ensemble.  One of
>> the solr node goes down as soon as we send search traffic to it, but update
>> works fine.
>>
>> When I analyzed thread dump I saw lot of blocked threads with following
>> error message.  This explains why it couldn't create any native threads and
>> ran out of memory.  The thread count went from 48 to 900 within minutes and
>> server came down.  The other node with same configuration is taking all the
>> search and update traffic, and it running fine.
>>
>> Any pointers would be appreciated.
>>
>> http-bio-52158-exec-59" - Thread t@589
>>    java.lang.Thread.State: BLOCKED on
>> org.apache.lucene.search.FieldCache$CreationPlaceholder@29e0400b owned
>> by: http-bio-52158-exec-61
>>  at
>> org.apache.lucene.search.FieldCacheImpl$Cache.get(FieldCacheImpl.java:209)
>> at
>> org.apache.lucene.search.FieldCacheImpl.getLongs(FieldCacheImpl.java:901)
>>  at
>> org.apache.lucene.search.FieldComparator$LongComparator.setNextReader(FieldComparator.java:685)
>> at
>> org.apache.lucene.search.TopFieldCollector$OneComparatorNonScoringCollector.setNextReader(TopFieldCollector.java:97)
>>  at
>> org.apache.lucene.search.TimeLimitingCollector.setNextReader(TimeLimitingCollector.java:158)
>> at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:618)
>>  at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:297)
>> at
>> org.apache.solr.search.SolrIndexSearcher.getDocListNC(SolrIndexSearcher.java:1501)
>>  at
>> org.apache.solr.search.SolrIndexSearcher.getDocListC(SolrIndexSearcher.java:1367)
>> at
>> org.apache.solr.search.SolrIndexSearcher.search(SolrIndexSearcher.java:474)
>>  at
>> org.apache.solr.handler.component.QueryComponent.process(QueryComponent.java:434)
>> at
>> org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:208)
>>  at
>> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
>> at org.apache.solr.core.SolrCore.execute(SolrCore.java:1859)
>>  at
>> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:703)
>> at
>> com.trimp.search.filter.LogAndAuthFilter.execute(LogAndAuthFilter.scala:109)
>>  at
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:406)
>> at
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:195)
>>  at
>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
>> at
>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
>>  at
>> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:222)
>> at
>> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:123)
>>  at
>> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:171)
>> at
>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:99)
>>  at
>> org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:947)
>> at org.apache.catalina.valves.RemoteIpValve.invoke(RemoteIpValve.java:680)
>>  at
>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:118)
>> at
>> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:408)
>>  at
>> org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1009)
>> at
>> org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:589)
>>  at
>> org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:310)
>> - locked org.apache.tomcat.util.net.SocketWrapper@5b4530c8
>>  at
>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>> at
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>>  at java.lang.Thread.run(Thread.java:722)
>>
>>    Locked ownable synchronizers:
>> - locked java.util.concurrent.ThreadPoolExecutor$Worker@63d2720
>>
>> -E
>>

Reply via email to