[ 
https://issues.apache.org/jira/browse/GEODE-7796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17039567#comment-17039567
 ] 

ASF subversion and git services commented on GEODE-7796:
--------------------------------------------------------

Commit 290df3a182c203c17e13bc7bc3cc8701ba326fc1 in geode's branch 
refs/heads/release/1.12.0 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=290df3a ]

Feature/geode 7796: CI hang in LocatorDUnitTest.testCrashLocatorMultipleTimes 
(#4706)

* GEODE-7796: CI failure in LocatorDUnitTest

The new test hung during a CI run due to a flaw in Geode.  I'm disabling
it until the flaw is corrected.

* reduce locator-wait-time (seconds, not millis)

also several other changes to fix the underlying failure:
  1) add a synchronization to the services restart thread so only one
  thread is active at a time
  2) shut down membership cleanup executor and avoid creating multiple
  TcpServer executors on auto-reconnect
  3) remove setting the locator as a dependent of the
  InternalDistributedSystem.  This was causing locator.stop() to be
  invoked multiple times (as Dale noticed in his analysis)
  4) when stopping a locator for auto-reconnect wait for it to stop in
  order to avoid creating multiple restart threads when there are
  cascading failures.

(cherry picked from commit 71fafc83844d3c13a228c705d32df374e5630651)


> CI: org.apache.geode.distributed.LocatorDUnitTest 
> testCrashLocatorMultipleTimes hung
> ------------------------------------------------------------------------------------
>
>                 Key: GEODE-7796
>                 URL: https://issues.apache.org/jira/browse/GEODE-7796
>             Project: Geode
>          Issue Type: Bug
>          Components: membership
>            Reporter: Jinmei Liao
>            Priority: Major
>             Fix For: 1.13.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/1563
> in the artifacts callbacks/dunit-hang.txt, 
> Started @ 2020-02-11 00:30:32.499 +0000
> 2020-02-11 01:07:59.054 +0000  org.apache.geode.distributed.LocatorDUnitTest 
> testCrashLocatorMultipleTimes
> Ended @ 2020-02-11 02:05:31.891 +0000
> and the stacktraces shows thread gets blocked for a long time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to