From: Paolo Abeni <pab...@redhat.com> Otherwise we will find stray/unexpected/old extensions value on next iteration.
On tcp_write_xmit() we can end-up splitting an already queued skb in two parts, via tso_fragment(). The newly created skb can be allocated via the tx cache and the mptcp stack will not be aware of it, so nobody set properly the MPTCP ext. End result, we transmit the skb using an hold MPTCP DSS map and that confuses the rx side/corrupt the stream. It requires some concurrent conditions, so it's not deterministic. Resetting the ext on recycle fixes all the current mptcp self tests issues. Apparently only MPTCP has issues with this kind of stray ext, so an alternative would be add an additional mptcp hook in tso_fragment() or in sk_stream_alloc_skb() to always init the ext. Signed-off-by: Paolo Abeni <pab...@redhat.com> --- include/linux/skbuff.h | 8 ++++++++ include/net/sock.h | 1 + 2 files changed, 9 insertions(+) diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h index 28bdaf978e72..37387ab9f336 100644 --- a/include/linux/skbuff.h +++ b/include/linux/skbuff.h @@ -4024,6 +4024,14 @@ static inline void skb_ext_put(struct sk_buff *skb) __skb_ext_put(skb->extensions); } +static inline void skb_ext_clear(struct sk_buff *skb) +{ + if (skb->active_extensions) { + __skb_ext_put(skb->extensions); + skb->active_extensions = 0; + } +} + static inline void __skb_ext_copy(struct sk_buff *dst, const struct sk_buff *src) { diff --git a/include/net/sock.h b/include/net/sock.h index e9d769c04637..bfa695716721 100644 --- a/include/net/sock.h +++ b/include/net/sock.h @@ -1469,6 +1469,7 @@ static inline void sk_wmem_free_skb(struct sock *sk, struct sk_buff *skb) sk->sk_wmem_queued -= skb->truesize; sk_mem_uncharge(sk, skb->truesize); if (!sk->sk_tx_skb_cache && !skb_cloned(skb)) { + skb_ext_clear(skb); skb_zcopy_clear(skb, true); sk->sk_tx_skb_cache = skb; return; -- 2.22.0