Shilun Fan created RATIS-2501:
---------------------------------

             Summary: Improve diagnostics for 
testInstallSnapshotDuringBootstrap timeout failures
                 Key: RATIS-2501
                 URL: https://issues.apache.org/jira/browse/RATIS-2501
             Project: Ratis
          Issue Type: Sub-task
          Components: gRPC
            Reporter: Shilun Fan
            Assignee: Shilun Fan


The {{testInstallSnapshotDuringBootstrap}} test occasionally times out in CI 
with minimal diagnostic information:
{code:java}
java.util.concurrent.TimeoutException: testInstallSnapshotDuringBootstrap() 
timed out after 100 seconds
Suppressed: java.io.InterruptedIOException: retry 
policy=RetryForeverWithSleep(sleepTime = 100ms)
    at org.apache.ratis.client.impl.AdminImpl.setConfiguration
    at org.apache.ratis.server.impl.MiniRaftCluster.setConfiguration {code}
The current implementation uses unlimited retries for setConfiguration, which 
can cause the test to hang indefinitely until the overall test timeout occurs. 
When this happens, there's insufficient information in the logs to diagnose the 
root cause.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to