Re: SolrCloud unstable

2013-12-10 Thread Furkan KAMACI
Hi Martin; You can change your Java version from 1.6 to 1.7 u25 and test it again to see that whether it is related to version of Java. Thanks; Furkan KAMACI 2013/11/24 Lance Norskog > Yes, you should use a recent Java 7. Java 6 is end-of-life and no longer > supported by Oracle. Also, read u

Re: SolrCloud unstable

2013-11-24 Thread Lance Norskog
Yes, you should use a recent Java 7. Java 6 is end-of-life and no longer supported by Oracle. Also, read up on the various garbage collectors. It is a complex topic and there are many guides online. In particular there is a problem in some Java 6 releases that causes a massive memory leak in S

Re: SolrCloud unstable

2013-11-22 Thread Martin de Vries
We did some more monitoring and have some new information: Before the issue happens the garbage collector's "collection count" increases a lot. The increase seems to start about an hour before the real problem occurs: http://www.analyticsforapplications.com/GC.png [1] We tried both the g1

Re: SolrCloud unstable

2013-11-12 Thread yriveiro
Sometime ago I posted this issue http://lucene.472066.n3.nabble.com/Leader-election-fails-in-some-point-td4096514.html The link for screenshot is no longer available. When some shard fails and lost the leader I have those exceptions. - Best regards -- View this message in context: http://l

Re: SolrCloud unstable

2013-11-12 Thread Henrik Ossipoff Hansen
Hello, I’m experiencing sort of the same issue, but with much smaller indexes - although with much higher latency on disks during backup sessions on our NFS. I have a feeling the solution could be the same, so I’ll just leave my story here just in case, no solution found yet. http://lucene.472

Re: SolrCloud unstable

2013-11-12 Thread Yago Riveiro
Hi Martin, I have the same behaviour that you are describing with a setup that is pretty equal. 6 machines, ~50 shards with replicationFactor equal two. The most critical issue IMHO is the fact of the failover doens't work because a node is down and the other in recovery mode. In log I can se