[ 
https://issues.apache.org/jira/browse/GEODE-6309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16750214#comment-16750214
 ] 

Bruce Schuchardt commented on GEODE-6309:
-----------------------------------------

I suspected that this was a problem with the test but it isn't.  Logs for the 
failed run show that the reconnecting Locator tried to rejoin the distributed 
system but the server that took over the role of membership coordinator was 
still sorting out the two failures (Locator and server-2) and did not respond 
to the join request quickly enough, causing the reconnecting Locator to form a 
new cluster w/o that server.

> ClusterConfigLocatorRestartDUnitTest fails to spin up a new server
> ------------------------------------------------------------------
>
>                 Key: GEODE-6309
>                 URL: https://issues.apache.org/jira/browse/GEODE-6309
>             Project: Geode
>          Issue Type: Bug
>          Components: membership
>            Reporter: Bruce Schuchardt
>            Assignee: Bruce Schuchardt
>            Priority: Major
>
> One of this class's tests starts a locator and two servers, then it 
> force-disconnects the locator and one of the servers & waits for the locator 
> to reconnect.  After that it starts a third server and expects it to join the 
> cluster but this failed to happen in CI run 316:
> {noformat}
> > Task :geode-core:distributedTest
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
>  > serverRestartsAfterLocatorReconnects FAILED
>     org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.rules.ClusterStartupRule$$Lambda$46/1297938526.call
>  in VM 3 running on Host 74139c18c4e4 with 5 VMs
>         at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:533)
>         at org.apache.geode.test.dunit.VM.invoke(VM.java:390)
>         at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:239)
>         at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:232)
>         at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:218)
>         at 
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartsAfterLocatorReconnects(ClusterConfigLocatorRestartDUnitTest.java:71)
>         Caused by:
>         org.apache.geode.SystemConnectException: Unable to join the 
> distributed system in 60032ms
> {noformat}
>  
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/316]
> SHA: 654dc3bac3e50e66f33385bdbc38c88750061aa9



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to