Odg: Geode retry/acknowledge improvement

Mario Ivanac Tue, 20 Apr 2021 07:31:36 -0700

Hi,

after analysis, we  assume that proxy at reception of packets,  sends ACK on 
TCP level, and after that moment proxy is restarted.
This is the reason, we dont see tcp retries.

Simular problem to this (but not packet loss), can be reproduce on geode,
if on existing connection, after request is sent, tcp reset is received. In 
that case, at reception of reset
connection will be closed, and thread will get stuck while waiting on reply.
I will add reproduction steps in ticket.

________________________________
Šalje: Anthony Baker <[email protected]>
Poslano: 19. travnja 2021. 22:54
Prima: [email protected] <[email protected]>
Predmet: Re: Geode retry/acknowledge improvement

Do you have a tcpdump that demonstrates the packet loss? How long did you wait 
for TCP to retry the failed packet delivery (sometimes this can be tweaked with 
tcp_retries2).  Does this manifest as a failed socket connection in geode?  
That ought to trigger some error handling IIRC.

Anthony

> On Apr 19, 2021, at 7:16 AM, Mario Ivanac <[email protected]> wrote:
>
> Hi all,
>
> we have deployed geode cluster in kubernetes environment, and Istio/SideCars 
> are injected between cluster members.
> While running traffic, if any Istio/SideCar is restarted, thread will get 
> stuck indefinitely, while waiting for reply on sent message.
> It seams that due to restarting of proxy, in some cases, messages are lost, 
> and sending side is waiting indefinitely for reply.
>
> https://issues.apache.org/jira/browse/GEODE-9075
>
> My question is, what is your estimation, how much effort/work is needed to 
> implement message retry/acknowledge logic in geode,
> to solve this problem?
>
> BR,
> Mario

Odg: Geode retry/acknowledge improvement

Reply via email to