On 10 October 2017 at 12:13, Eric Garver <e...@erig.me> wrote: > On Tue, Oct 10, 2017 at 10:24:20AM -0700, Joe Stringer wrote: >> On 10 October 2017 at 08:09, Eric Garver <e...@erig.me> wrote: >> > On Tue, Oct 10, 2017 at 05:33:48AM -0700, Joe Stringer wrote: >> >> On 9 October 2017 at 21:41, Pravin Shelar <pshe...@ovn.org> wrote: >> >> > On Fri, Oct 6, 2017 at 9:44 AM, Eric Garver <e...@erig.me> wrote: >> >> >> This adds a ct_clear action for clearing conntrack state. ct_clear is >> >> >> currently implemented in OVS userspace, but is not backed by an action >> >> >> in the kernel datapath. This is useful for flows that may modify a >> >> >> packet tuple after a ct lookup has already occurred. >> >> >> >> >> >> Signed-off-by: Eric Garver <e...@erig.me> >> >> > Patch mostly looks good. I have following comments. >> >> > >> >> >> --- >> >> >> include/uapi/linux/openvswitch.h | 2 ++ >> >> >> net/openvswitch/actions.c | 5 +++++ >> >> >> net/openvswitch/conntrack.c | 12 ++++++++++++ >> >> >> net/openvswitch/conntrack.h | 7 +++++++ >> >> >> net/openvswitch/flow_netlink.c | 5 +++++ >> >> >> 5 files changed, 31 insertions(+) >> >> >> >> >> >> diff --git a/include/uapi/linux/openvswitch.h >> >> >> b/include/uapi/linux/openvswitch.h >> >> >> index 156ee4cab82e..1b6e510e2cc6 100644 >> >> >> --- a/include/uapi/linux/openvswitch.h >> >> >> +++ b/include/uapi/linux/openvswitch.h >> >> >> @@ -806,6 +806,7 @@ struct ovs_action_push_eth { >> >> >> * packet. >> >> >> * @OVS_ACTION_ATTR_POP_ETH: Pop the outermost Ethernet header off the >> >> >> * packet. >> >> >> + * @OVS_ACTION_ATTR_CT_CLEAR: Clear conntrack state from the packet. >> >> >> * >> >> >> * Only a single header can be set with a single >> >> >> %OVS_ACTION_ATTR_SET. Not all >> >> >> * fields within a header are modifiable, e.g. the IPv4 protocol and >> >> >> fragment >> >> >> @@ -835,6 +836,7 @@ enum ovs_action_attr { >> >> >> OVS_ACTION_ATTR_TRUNC, /* u32 struct ovs_action_trunc. >> >> >> */ >> >> >> OVS_ACTION_ATTR_PUSH_ETH, /* struct ovs_action_push_eth. */ >> >> >> OVS_ACTION_ATTR_POP_ETH, /* No argument. */ >> >> >> + OVS_ACTION_ATTR_CT_CLEAR, /* No argument. */ >> >> >> >> >> >> __OVS_ACTION_ATTR_MAX, /* Nothing past this will be >> >> >> accepted >> >> >> * from userspace. */ >> >> >> diff --git a/net/openvswitch/actions.c b/net/openvswitch/actions.c >> >> >> index a54a556fcdb5..db9c7f2e662b 100644 >> >> >> --- a/net/openvswitch/actions.c >> >> >> +++ b/net/openvswitch/actions.c >> >> >> @@ -1203,6 +1203,10 @@ static int do_execute_actions(struct datapath >> >> >> *dp, struct sk_buff *skb, >> >> >> return err == -EINPROGRESS ? 0 : err; >> >> >> break; >> >> >> >> >> >> + case OVS_ACTION_ATTR_CT_CLEAR: >> >> >> + err = ovs_ct_clear(skb, key); >> >> >> + break; >> >> >> + >> >> >> case OVS_ACTION_ATTR_PUSH_ETH: >> >> >> err = push_eth(skb, key, nla_data(a)); >> >> >> break; >> >> >> @@ -1210,6 +1214,7 @@ static int do_execute_actions(struct datapath >> >> >> *dp, struct sk_buff *skb, >> >> >> case OVS_ACTION_ATTR_POP_ETH: >> >> >> err = pop_eth(skb, key); >> >> >> break; >> >> >> + >> >> >> } >> >> > Unrelated change. >> >> > >> >> >> >> >> >> if (unlikely(err)) { >> >> >> diff --git a/net/openvswitch/conntrack.c b/net/openvswitch/conntrack.c >> >> >> index d558e882ca0c..f9b73c726ad7 100644 >> >> >> --- a/net/openvswitch/conntrack.c >> >> >> +++ b/net/openvswitch/conntrack.c >> >> >> @@ -1129,6 +1129,18 @@ int ovs_ct_execute(struct net *net, struct >> >> >> sk_buff *skb, >> >> >> return err; >> >> >> } >> >> >> >> >> >> +int ovs_ct_clear(struct sk_buff *skb, struct sw_flow_key *key) >> >> >> +{ >> >> >> + if (skb_nfct(skb)) { >> >> >> + nf_conntrack_put(skb_nfct(skb)); >> >> >> + nf_ct_set(skb, NULL, 0); >> >> > Can the new conntract state be appropriate? may be IP_CT_UNTRACKED? >> >> > >> >> >> + } >> >> >> + >> >> >> + ovs_ct_fill_key(skb, key); >> >> >> + >> >> > I do not see need to refill the key if there is no skb-nf-ct. >> >> >> >> Really this is trying to just zero the CT key fields, but reuses >> >> existing functions, right? This means that subsequent upcalls, for >> > >> > Right. >> > >> >> instance, won't have the outdated view of the CT state from the >> >> previous lookup (that was prior to the ct_clear). I'd expect these key >> >> fields to be cleared. >> > >> > I assumed Pravin was saying that we don't need to clear them if there is >> > no conntrack state. They should already be zero. >> >> The conntrack calls aren't going to clear it, so I don't see what else >> would clear it? >> >> If you execute ct(),ct_clear(), then the first ct will set the >> values.. what will zero them? > > I meant move ovs_ct_fill_key() to inside the if statement. > i.e. > > if (skb_nfct(skb)) { > nf_conntrack_put(skb_nfct(skb)); > nf_ct_set(skb, NULL, IP_CT_UNTRACKED); > ovs_ct_fill_key(skb, key); > } > > Should be nothing to fill/zero if we have not yet done conntrack. > Is there a case where we may lose skb->_nfct, but the key still has > conntrack data?
Ah, misreading on my part. Right, if there is no nfct then it should already have the right value. I don't think there's a way to lose it and leave the key with conntrack data.