From: Paolo Abeni <pab...@redhat.com> Otherwise we will find stray/unexpected/old extensions value on next iteration.
On tcp_write_xmit() we can end-up splitting an already queued skb in two parts, via tso_fragment(). The newly created skb can be allocated via the tx cache and the mptcp stack will not be aware of it, so nobody set properly the MPTCP ext. End result, we transmit the skb using an hold MPTCP DSS map and that confuses the rx side/corrupt the stream. It requires some concurrent conditions, so it's not deterministic. Resetting the ext on recycle fixes all the current mptcp self tests issues. Apparently only MPTCP has issues with this kind of stray ext, so an alternative would be add an additional mptcp hook in tso_fragment() or in sk_stream_alloc_skb() to always init the ext. Signed-off-by: Paolo Abeni <pab...@redhat.com> --- include/linux/skbuff.h | 8 ++++++++ include/net/sock.h | 1 + 2 files changed, 9 insertions(+) diff --git a/include/linux/skbuff.h b/include/linux/skbuff.h index e7d3b1a513ef..e7a7abd62026 100644 --- a/include/linux/skbuff.h +++ b/include/linux/skbuff.h @@ -4099,6 +4099,14 @@ static inline void skb_ext_put(struct sk_buff *skb) __skb_ext_put(skb->extensions); } +static inline void skb_ext_clear(struct sk_buff *skb) +{ + if (skb->active_extensions) { + __skb_ext_put(skb->extensions); + skb->active_extensions = 0; + } +} + static inline void __skb_ext_copy(struct sk_buff *dst, const struct sk_buff *src) { diff --git a/include/net/sock.h b/include/net/sock.h index ca2071555dde..b9a085d0bb18 100644 --- a/include/net/sock.h +++ b/include/net/sock.h @@ -1470,6 +1470,7 @@ static inline void sk_wmem_free_skb(struct sock *sk, struct sk_buff *skb) sk_mem_uncharge(sk, skb->truesize); if (static_branch_unlikely(&tcp_tx_skb_cache_key) && !sk->sk_tx_skb_cache && !skb_cloned(skb)) { + skb_ext_clear(skb); skb_zcopy_clear(skb, true); sk->sk_tx_skb_cache = skb; return; -- 2.23.0