Shilun Fan created RATIS-2501:
---------------------------------
Summary: Improve diagnostics for
testInstallSnapshotDuringBootstrap timeout failures
Key: RATIS-2501
URL: https://issues.apache.org/jira/browse/RATIS-2501
Project: Ratis
Issue Type: Sub-task
Components: gRPC
Reporter: Shilun Fan
Assignee: Shilun Fan
The {{testInstallSnapshotDuringBootstrap}} test occasionally times out in CI
with minimal diagnostic information:
{code:java}
java.util.concurrent.TimeoutException: testInstallSnapshotDuringBootstrap()
timed out after 100 seconds
Suppressed: java.io.InterruptedIOException: retry
policy=RetryForeverWithSleep(sleepTime = 100ms)
at org.apache.ratis.client.impl.AdminImpl.setConfiguration
at org.apache.ratis.server.impl.MiniRaftCluster.setConfiguration {code}
The current implementation uses unlimited retries for setConfiguration, which
can cause the test to hang indefinitely until the overall test timeout occurs.
When this happens, there's insufficient information in the logs to diagnose the
root cause.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)