Alyssa Huang created KAFKA-18345:
------------------------------------
Summary: Investigate if binaryExponentialElectionBackoffMs can be
removed or improved
Key: KAFKA-18345
URL: https://issues.apache.org/jira/browse/KAFKA-18345
Project: Kafka
Issue Type: Improvement
Reporter: Alyssa Huang
Right now this exponential backoff method can return a min backoff of 200ms and
cap out at the electionBackoffMaxMs (1s default). The logic for calculating
this exponential backoff isn't very straightforward, and probably no longer
needed now that Candidate transitions to Prospective after election loss. We
can simplify states by not tracking "retries" as well.
We _do_ still need a bit of randomness to ensure we can get past gridlocked
elections, but as Diego's Raft thesis shows, 50ms of random range is sufficient
to avoid repeat split votes in elections.
> Using more randomness improves worst-case behavior: with a 50 ms random
> range, the worst-case completion time (over 1,000 trials) was 513 ms.
We should investigate if it would make sense to replace the exponential backoff
with a non-exponential backoff that is more straightforward.
e.g.
{code:java}
int randomElectionBackoffMs() {
long backoff = random(0, 50ms)
return Math.min(state.remainingElectionTimeoutMs, backoff)
}{code}
Note, it's fine to return state.remainingElectionTimeoutMs because that also
has randomness in it.
The alternative could also be using a lower default electionTimeoutMs (like
300ms mentioned in the Raft paper) and on election failure, waiting the extent
of state.remainingElectionTimeoutMs (which contains some jitter).
--
This message was sent by Atlassian Jira
(v8.20.10#820010)