[ 
https://issues.apache.org/jira/browse/GEODE-8267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bruce J Schuchardt updated GEODE-8267:
--------------------------------------
    Description: 
hang: 
[https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK11/builds/275#A]

 

The test hung in serverRestartsAfterOneLocatorDies after another failure in the 
same test class.

Here's the hung thread:
{noformat}
"Test worker" #27 prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s 
tid=0x00007f01c8ad4800 nid=0x18 runnable  [0x00007f019872c000]"Test worker" #27 
prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s tid=0x00007f01c8ad4800 nid=0x18 
runnable  [0x00007f019872c000]   java.lang.Thread.State: RUNNABLE at 
java.net.SocketInputStream.socketRead0(java.base@11.0.7/Native Method) at 
java.net.SocketInputStream.socketRead(java.base@11.0.7/SocketInputStream.java:115)
 at 
java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:168) at 
java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:140) at 
java.io.BufferedInputStream.fill(java.base@11.0.7/BufferedInputStream.java:252) 
at 
java.io.BufferedInputStream.read(java.base@11.0.7/BufferedInputStream.java:271) 
- locked <0x00000000d08fe7a0> (a java.io.BufferedInputStream) at 
java.io.DataInputStream.readByte(java.base@11.0.7/DataInputStream.java:270) at 
sun.rmi.transport.StreamRemoteCall.executeCall(java.rmi@11.0.7/StreamRemoteCall.java:240)
 at sun.rmi.server.UnicastRef.invoke(java.rmi@11.0.7/UnicastRef.java:164) at 
java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:217)
 at 
java.rmi.server.RemoteObjectInvocationHandler.invoke(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:162)
 at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source) at 
org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607) at 
org.apache.geode.test.dunit.VM.invoke(VM.java:450) at 
org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:268)
 at 
org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:261)
 at 
org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:256)
 at 
org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartsAfterOneLocatorDies(ClusterConfigLocatorRestartDUnitTest.java:114)
 at 
jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.7/Native 
Method) {noformat}
Here's the previous test failure, which may have affected the test that hung:
{code:java}
org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
 > serverRestartHangsWaitingForStartupMessageResponse FAILED

    org.junit.runners.model.TestTimedOutException: test timed out after 300000 
milliseconds

        at java.net.SocketInputStream.socketRead0(Native Method)

        at java.net.SocketInputStream.socketRead(SocketInputStream.java:115)

        at java.net.SocketInputStream.read(SocketInputStream.java:168)

        at java.net.SocketInputStream.read(SocketInputStream.java:140)

        at java.io.BufferedInputStream.fill(BufferedInputStream.java:252)

        at java.io.BufferedInputStream.read(BufferedInputStream.java:271)

        at java.io.DataInputStream.readByte(DataInputStream.java:270)

        at 
sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:240)

        at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)

        at 
java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)

        at 
java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)

        at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source)

        at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607)

        at org.apache.geode.test.dunit.VM.invoke(VM.java:437)

        at 
org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:94)

        at 
org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartHangsWaitingForStartupMessageResponse(ClusterConfigLocatorRestartDUnitTest.java:176)
{code}
Seems like 300s should be long enough so I fear there may be a real problem 
here.

  was:
hang: 
https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK11/builds/275#A

{code}
org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
 > serverRestartHangsWaitingForStartupMessageResponse FAILED

    org.junit.runners.model.TestTimedOutException: test timed out after 300000 
milliseconds

        at java.net.SocketInputStream.socketRead0(Native Method)

        at java.net.SocketInputStream.socketRead(SocketInputStream.java:115)

        at java.net.SocketInputStream.read(SocketInputStream.java:168)

        at java.net.SocketInputStream.read(SocketInputStream.java:140)

        at java.io.BufferedInputStream.fill(BufferedInputStream.java:252)

        at java.io.BufferedInputStream.read(BufferedInputStream.java:271)

        at java.io.DataInputStream.readByte(DataInputStream.java:270)

        at 
sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:240)

        at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)

        at 
java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)

        at 
java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)

        at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source)

        at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607)

        at org.apache.geode.test.dunit.VM.invoke(VM.java:437)

        at 
org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:94)

        at 
org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartHangsWaitingForStartupMessageResponse(ClusterConfigLocatorRestartDUnitTest.java:176)
{code}

Seems like 300s should be long enough so I fear there may be a real problem 
here.


> serverRestartsAfterOneLocatorDies hangs
> ---------------------------------------
>
>                 Key: GEODE-8267
>                 URL: https://issues.apache.org/jira/browse/GEODE-8267
>             Project: Geode
>          Issue Type: Bug
>          Components: configuration, locator, membership
>            Reporter: Bill Burcham
>            Priority: Major
>
> hang: 
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK11/builds/275#A]
>  
> The test hung in serverRestartsAfterOneLocatorDies after another failure in 
> the same test class.
> Here's the hung thread:
> {noformat}
> "Test worker" #27 prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s 
> tid=0x00007f01c8ad4800 nid=0x18 runnable  [0x00007f019872c000]"Test worker" 
> #27 prio=5 os_prio=0 cpu=5016.73ms elapsed=5638.52s tid=0x00007f01c8ad4800 
> nid=0x18 runnable  [0x00007f019872c000]   java.lang.Thread.State: RUNNABLE at 
> java.net.SocketInputStream.socketRead0(java.base@11.0.7/Native Method) at 
> java.net.SocketInputStream.socketRead(java.base@11.0.7/SocketInputStream.java:115)
>  at 
> java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:168) 
> at 
> java.net.SocketInputStream.read(java.base@11.0.7/SocketInputStream.java:140) 
> at 
> java.io.BufferedInputStream.fill(java.base@11.0.7/BufferedInputStream.java:252)
>  at 
> java.io.BufferedInputStream.read(java.base@11.0.7/BufferedInputStream.java:271)
>  - locked <0x00000000d08fe7a0> (a java.io.BufferedInputStream) at 
> java.io.DataInputStream.readByte(java.base@11.0.7/DataInputStream.java:270) 
> at 
> sun.rmi.transport.StreamRemoteCall.executeCall(java.rmi@11.0.7/StreamRemoteCall.java:240)
>  at sun.rmi.server.UnicastRef.invoke(java.rmi@11.0.7/UnicastRef.java:164) at 
> java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:217)
>  at 
> java.rmi.server.RemoteObjectInvocationHandler.invoke(java.rmi@11.0.7/RemoteObjectInvocationHandler.java:162)
>  at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source) at 
> org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607) at 
> org.apache.geode.test.dunit.VM.invoke(VM.java:450) at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:268)
>  at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:261)
>  at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:256)
>  at 
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartsAfterOneLocatorDies(ClusterConfigLocatorRestartDUnitTest.java:114)
>  at 
> jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.7/Native 
> Method) {noformat}
> Here's the previous test failure, which may have affected the test that hung:
> {code:java}
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest
>  > serverRestartHangsWaitingForStartupMessageResponse FAILED
>     org.junit.runners.model.TestTimedOutException: test timed out after 
> 300000 milliseconds
>         at java.net.SocketInputStream.socketRead0(Native Method)
>         at java.net.SocketInputStream.socketRead(SocketInputStream.java:115)
>         at java.net.SocketInputStream.read(SocketInputStream.java:168)
>         at java.net.SocketInputStream.read(SocketInputStream.java:140)
>         at java.io.BufferedInputStream.fill(BufferedInputStream.java:252)
>         at java.io.BufferedInputStream.read(BufferedInputStream.java:271)
>         at java.io.DataInputStream.readByte(DataInputStream.java:270)
>         at 
> sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:240)
>         at sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)
>         at 
> java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)
>         at 
> java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)
>         at com.sun.proxy.$Proxy53.executeMethodOnObject(Unknown Source)
>         at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:607)
>         at org.apache.geode.test.dunit.VM.invoke(VM.java:437)
>         at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:94)
>         at 
> org.apache.geode.management.internal.configuration.ClusterConfigLocatorRestartDUnitTest.serverRestartHangsWaitingForStartupMessageResponse(ClusterConfigLocatorRestartDUnitTest.java:176)
> {code}
> Seems like 300s should be long enough so I fear there may be a real problem 
> here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to