https://issues.apache.org/bugzilla/show_bug.cgi?id=45261
Summary: Concurrent node failure leads to inconsistent views. Product: Tomcat 6 Version: 6.0.16 Platform: PC OS/Version: Linux Status: NEW Severity: normal Priority: P2 Component: Cluster AssignedTo: [EMAIL PROTECTED] ReportedBy: [EMAIL PROTECTED] Created an attachment (id=22166) --> (https://issues.apache.org/bugzilla/attachment.cgi?id=22166) Demonstrate view inconsistency. In a four node cluster, using NonBlockingCoordinator, if two nodes fail at the same time, the remaining two nodes get different views and never converge. When the other nodes restart, they never install a view at all. I've attached the relevant demo code. Run it on 4 machines, wait for view installation, then CTRL-C two of them. The other two will never print the same UniqueId. Start a new node, view is always null. Immediately after the two node failure, one of the surviving nodes issues this stack trace; WARN - Member send is failing for:tcp://{-64, -88, -91, 34}:4000 ; Setting to su spect and retrying. ERROR - Error processing coordination message. Could be fatal. org.apache.catalina.tribes.ChannelException: Send failed, attempt:2 max:1; Fault y members:tcp://{-64, -88, -91, 34}:4000; at org.apache.catalina.tribes.transport.nio.ParallelNioSender.doLoop(Par allelNioSender.java:172) at org.apache.catalina.tribes.transport.nio.ParallelNioSender.sendMessag e(ParallelNioSender.java:78) at org.apache.catalina.tribes.transport.nio.PooledParallelSender.sendMes sage(PooledParallelSender.java:53) at org.apache.catalina.tribes.transport.ReplicationTransmitter.sendMessa ge(ReplicationTransmitter.java:80) at org.apache.catalina.tribes.group.ChannelCoordinator.sendMessage(Chann elCoordinator.java:78) at org.apache.catalina.tribes.group.ChannelInterceptorBase.sendMessage(C hannelInterceptorBase.java:75) at org.apache.catalina.tribes.group.interceptors.NonBlockingCoordinator. handleMyToken(NonBlockingCoordina -- Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]