From: Steffen Klassert <steffen.klass...@secunet.com> Date: Fri, 15 Jan 2021 10:27:52 +0100
> On Fri, Jan 15, 2021 at 05:55:22PM +0900, Dongseok Yi wrote: >> On 2021-01-15 17:12, Steffen Klassert wrote: >>> On Fri, Jan 15, 2021 at 02:58:24PM +0900, Dongseok Yi wrote: >>>> UDP/IP header of UDP GROed frag_skbs are not updated even after NAT >>>> forwarding. Only the header of head_skb from ip_finish_output_gso -> >>>> skb_gso_segment is updated but following frag_skbs are not updated. >>>> >>>> A call path skb_mac_gso_segment -> inet_gso_segment -> >>>> udp4_ufo_fragment -> __udp_gso_segment -> __udp_gso_segment_list >>>> does not try to update UDP/IP header of the segment list. >>> >>> We still need to find out why it works for Alexander, but not for you. >>> Different usecases? >> >> This patch is not for >> https://lore.kernel.org/patchwork/patch/1364544/ >> Alexander might want to call udp_gro_receive_segment even when >> !sk and ~NETIF_F_GRO_FRAGLIST. > > Yes, I know. But he said that fraglist GRO + NAT works for him. > I want to find out why it works for him, but not for you. I found that it worked for me because I advertised fraglist GSO support in my driver (and added actual support for xmitting fraglists). If so, kernel won't resegment GSO into a list of plain packets, so no __udp_gso_segment_list() will be called. I think it will break if I disable fraglist GSO feature through Ethtool, so I could test your patches. >>> >>> I would not like to add this to a generic codepath. I think we can >>> relatively easy copy the full headers in skb_segment_list(). >> >> I tried to copy the full headers with the similar approach, but it >> copies length too. Can we keep the length of each skb of the fraglist? > > Ah yes, good point. > > Then maybe you can move your approach into __udp_gso_segment_list() > so that we dont touch generic code. > >> >>> >>> I think about something like the (completely untested) patch below: >>> >>> diff --git a/net/core/skbuff.c b/net/core/skbuff.c >>> index f62cae3f75d8..63ae7f79fad7 100644 >>> --- a/net/core/skbuff.c >>> +++ b/net/core/skbuff.c >>> @@ -3651,13 +3651,14 @@ struct sk_buff *skb_segment_list(struct sk_buff >>> *skb, >>> unsigned int offset) >>> { >>> struct sk_buff *list_skb = skb_shinfo(skb)->frag_list; >>> + unsigned int doffset = skb->data - skb_mac_header(skb); >>> unsigned int tnl_hlen = skb_tnl_header_len(skb); >>> unsigned int delta_truesize = 0; >>> unsigned int delta_len = 0; >>> struct sk_buff *tail = NULL; >>> struct sk_buff *nskb; >>> >>> - skb_push(skb, -skb_network_offset(skb) + offset); >>> + skb_push(skb, doffset); >>> >>> skb_shinfo(skb)->frag_list = NULL; >>> >>> @@ -3675,7 +3676,7 @@ struct sk_buff *skb_segment_list(struct sk_buff *skb, >>> delta_len += nskb->len; >>> delta_truesize += nskb->truesize; >>> >>> - skb_push(nskb, -skb_network_offset(nskb) + offset); >>> + skb_push(nskb, doffset); >>> >>> skb_release_head_state(nskb); >>> __copy_skb_header(nskb, skb); >>> diff --git a/net/ipv4/udp_offload.c b/net/ipv4/udp_offload.c >>> index ff39e94781bf..1181398378b8 100644 >>> --- a/net/ipv4/udp_offload.c >>> +++ b/net/ipv4/udp_offload.c >>> @@ -190,9 +190,22 @@ EXPORT_SYMBOL(skb_udp_tunnel_segment); >>> static struct sk_buff *__udp_gso_segment_list(struct sk_buff *skb, >>> netdev_features_t features) >>> { >>> + struct sk_buff *list_skb = skb_shinfo(skb)->frag_list; >>> unsigned int mss = skb_shinfo(skb)->gso_size; >>> + unsigned int offset; >>> >>> - skb = skb_segment_list(skb, features, skb_mac_header_len(skb)); >>> + skb_headers_offset_update(list_skb, skb_headroom(list_skb) - >>> skb_headroom(skb)); >>> + >>> + /* Check for header changes and copy the full header in that case. */ >>> + if ((udp_hdr(skb)->dest == udp_hdr(list_skb)->dest) && >>> + (udp_hdr(skb)->source == udp_hdr(list_skb)->source) && >>> + (ip_hdr(skb)->daddr == ip_hdr(list_skb)->daddr) && >>> + (ip_hdr(skb)->saddr == ip_hdr(list_skb)->saddr)) >>> + offset = skb_mac_header_len(skb); >>> + else >>> + offset = skb->data - skb_mac_header(skb); >>> + >>> + skb = skb_segment_list(skb, features, offset); >>> if (IS_ERR(skb)) >>> return skb; >>> >>> >>> After that you can apply the CSUM magic in __udp_gso_segment_list(). I'll test and let you know if it works. If doesn't, I think I'll be able to get a working one based on this. >> Sorry, I don't know CSUM magic well. Is it used for checksum >> incremental update too? > > With that I meant the checksum updating you did in your patch. Thanks, Al