[ https://issues.apache.org/jira/browse/GEODE-6107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16710517#comment-16710517 ]
ASF subversion and git services commented on GEODE-6107: -------------------------------------------------------- Commit 6a0e08b034d1b57b5e680f67c7c4bbe918a1f053 in geode's branch refs/heads/feature/GEODE-6149 from [~balesh2] [ https://gitbox.apache.org/repos/asf?p=geode.git;h=6a0e08b ] GEODE-6107: in test start locators one at a time (#2933) * GEODE-6107: in test start locators one at a time There exists a known race condition in starting locators such that if two locators are started simultaneously, not all the MBeans get federated to both members. This was causing intermittent test failures in the before() of JMXMBeanReconnectDUnitTest. To fix this, a wait has been added so that the first member has enough time to start before the second one. A wait has also been added after starting locator2 so that the locator has enough time to start before attempting to establish a JMX connection. If the locator has not fully started, connecting to the MBean server will result in an exception. > CI Failure: org.apache.geode.management.JMXMBeanReconnectDUnitTest > > testRemoteBeanKnowledge_MaintainServerAndCrashLocator > -------------------------------------------------------------------------------------------------------------------------- > > Key: GEODE-6107 > URL: https://issues.apache.org/jira/browse/GEODE-6107 > Project: Geode > Issue Type: Bug > Reporter: Ryan McMahon > Assignee: Helena Bales > Priority: Major > Labels: pull-request-available > Fix For: 1.9.0 > > Time Spent: 1h 20m > Remaining Estimate: 0h > > Build: > https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/172 > Results: > http://files.apachegeode-ci.info/builds/apache-develop-main/1.9.0-build.210/test-results/distributedTest/1543449109/ > Artifacts: > http://files.apachegeode-ci.info/builds/apache-develop-main/1.9.0-build.210/test-artifacts/1543449109/distributedtestfiles-OpenJDK8-1.9.0-build.210.tgz > {noformat}org.apache.geode.management.JMXMBeanReconnectDUnitTest > > testRemoteBeanKnowledge_MaintainServerAndCrashLocator FAILED > org.awaitility.core.ConditionTimeoutException: Condition with alias > 'Locators must agree on the state of the system' didn't complete within 300 > seconds because assertion condition defined as a lambda expression in > org.apache.geode.management.JMXMBeanReconnectDUnitTest > Expecting: > <[GemFire:service=Region,name="/test-region-1",type=Distributed, > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed, > GemFire:service=AccessControl,type=Distributed, > GemFire:service=FileUploader,type=Distributed, > GemFire:service=System,type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one, > GemFire:service=Locator,type=Member,member=locator-one, > GemFire:type=Member,member=locator-one, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-two, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-two, > GemFire:service=Locator,type=Member,member=locator-two, > GemFire:type=Member,member=locator-two, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-2, > GemFire:service=CacheServer,port=33929,type=Member,member=server-2, > GemFire:type=Member,member=server-2, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, > GemFire:service=CacheServer,port=46497,type=Member,member=server-3, > GemFire:type=Member,member=server-3]> > to contain exactly (and in same order): > <[GemFire:service=Region,name="/test-region-1",type=Distributed, > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed, > GemFire:service=AccessControl,type=Distributed, > GemFire:service=FileUploader,type=Distributed, > GemFire:service=System,type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one, > GemFire:service=Locator,type=Member,member=locator-one, > GemFire:type=Member,member=locator-one, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-two, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-two, > GemFire:service=Locator,type=Member,member=locator-two, > GemFire:type=Member,member=locator-two, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-2, > GemFire:service=CacheServer,port=33929,type=Member,member=server-2, > GemFire:type=Member,member=server-2, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, > GemFire:service=CacheServer,port=46497,type=Member,member=server-3, > GemFire:type=Member,member=server-3]> > but some elements were not expected: > > <[GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one]> > . > at > org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:145) > at > org.awaitility.core.AssertionCondition.await(AssertionCondition.java:122) > at > org.awaitility.core.AssertionCondition.await(AssertionCondition.java:32) > at > org.awaitility.core.ConditionFactory.until(ConditionFactory.java:902) > at > org.awaitility.core.ConditionFactory.untilAsserted(ConditionFactory.java:723) > at > org.apache.geode.management.JMXMBeanReconnectDUnitTest.before(JMXMBeanReconnectDUnitTest.java:105) > Caused by: > java.lang.AssertionError: > Expecting: > <[GemFire:service=Region,name="/test-region-1",type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed, > GemFire:service=AccessControl,type=Distributed, > GemFire:service=FileUploader,type=Distributed, > GemFire:service=System,type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one, > GemFire:service=Locator,type=Member,member=locator-one, > GemFire:type=Member,member=locator-one, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-two, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-two, > GemFire:service=Locator,type=Member,member=locator-two, > GemFire:type=Member,member=locator-two, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-2, > > GemFire:service=CacheServer,port=33929,type=Member,member=server-2, > GemFire:type=Member,member=server-2, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, > > GemFire:service=CacheServer,port=46497,type=Member,member=server-3, > GemFire:type=Member,member=server-3]> > to contain exactly (and in same order): > <[GemFire:service=Region,name="/test-region-1",type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed, > GemFire:service=AccessControl,type=Distributed, > GemFire:service=FileUploader,type=Distributed, > GemFire:service=System,type=Distributed, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one, > GemFire:service=Locator,type=Member,member=locator-one, > GemFire:type=Member,member=locator-one, > > GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-two, > > GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-two, > GemFire:service=Locator,type=Member,member=locator-two, > GemFire:type=Member,member=locator-two, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-2, > > GemFire:service=CacheServer,port=33929,type=Member,member=server-2, > GemFire:type=Member,member=server-2, > > GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, > > GemFire:service=CacheServer,port=46497,type=Member,member=server-3, > GemFire:type=Member,member=server-3]> > but some elements were not expected: > > <[GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one]> > at > org.apache.geode.management.JMXMBeanReconnectDUnitTest.lambda$before$0(JMXMBeanReconnectDUnitTest.java:106) > 8269 tests completed, 1 failed, 495 skipped > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)