[ https://issues.apache.org/jira/browse/GEODE-8267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17454164#comment-17454164 ]
Geode Integration commented on GEODE-8267: ------------------------------------------ Seen in [distributed-test-openjdk8 #330|https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-mass-test-run/jobs/distributed-test-openjdk8/builds/330] ... see [test results|http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.0705/test-results/distributedTest/1638604667/] or download [artifacts|http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.0705/test-artifacts/1638604667/distributedtestfiles-openjdk8-1.15.0-build.0705.tgz]. > serverRestartsAfterOneLocatorDies hangs > --------------------------------------- > > Key: GEODE-8267 > URL: https://issues.apache.org/jira/browse/GEODE-8267 > Project: Geode > Issue Type: Bug > Components: configuration, locator, membership > Reporter: Bill Burcham > Priority: Major > > hang: > [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK11/builds/275#A] > > The test hung in serverRestartsAfterOneLocatorDies after another failure in > the same test class. > Here's the hung thread: > {noformat} > "Test worker" #27 prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s > tid=0x00007f01c8ad4800 nid=0x18 runnable [0x00007f019872c000]"Test worker" > #27 prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s tid=0x00007f01c8ad4800 > nid=0x18 runnable [0x00007f019872c000] java.lang.Thread.State: RUNNABLE at > java.net.SocketInputStream.socketRead0(java.base@11.0.7/Native Method) at > java.net.SocketInputStream.socketRead(java.base@11.0.7/SocketInputStream.java:115) > at > java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:168) > at > java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:140) > at > java.io.BufferedInputStream.fill(java.base@11.0.7/BufferedInputStream.java:252) > at > java.io.BufferedInputStream.read(java.base@11.0.7/BufferedInputStream.java:271) > - locked <0x00000000d08fe7a0> (a java.io.BufferedInputStream) at > java.io.DataInputStream.readByte(java.base@11.0.7/DataInputStream.java:270) > at > sun.rmi.transport.StreamRemoteCall.executeCall(java.rmi@11.0.7/StreamRemoteCall.java:240) > at sun.rmi.server.UnicastRef.invoke(java.rmi@11.0.7/UnicastRef.java:164) at > java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:217) > at > java.rmi.server.RemoteObjectInvocationHandler.invoke(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:162) > at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source) at > org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607) at > org.apache.geode.test.dunit.VM.invoke(VM.java:450) at > org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:268) > at > org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:261) > at > org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:256) > at > org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartsAfterOneLocatorDies(ClusterConfigLocatorRestartDUnitTest.java:114) > at > jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.7/Native > Method) {noformat} > Here's the previous test failure, which may have affected the test that hung: > {code:java} > org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest > > serverRestartHangsWaitingForStartupMessageResponse FAILED > org.junit.runners.model.TestTimedOutException: test timed out after > 300000 milliseconds > at java.net.SocketInputStream.socketRead0(Native Method) > at java.net.SocketInputStream.socketRead(SocketInputStream.java:115) > at java.net.SocketInputStream.read(SocketInputStream.java:168) > at java.net.SocketInputStream.read(SocketInputStream.java:140) > at java.io.BufferedInputStream.fill(BufferedInputStream.java:252) > at java.io.BufferedInputStream.read(BufferedInputStream.java:271) > at java.io.DataInputStream.readByte(DataInputStream.java:270) > at > sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:240) > at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164) > at > java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217) > at > java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162) > at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source) > at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607) > at org.apache.geode.test.dunit.VM.invoke(VM.java:437) > at > org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:94) > at > org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartHangsWaitingForStartupMessageResponse(ClusterConfigLocatorRestartDUnitTest.java:176) > {code} > Seems like 300s should be long enough so I fear there may be a real problem > here. -- This message was sent by Atlassian Jira (v8.20.1#820001)