Hi Julian,

On Sun, Jun 12, 2016, at 02:09, Julian Anastasov wrote:
> 
>       Hello,
> 
> On Sat, 11 Jun 2016, Hannes Frederic Sowa wrote:
> 
> > Users of ToS rules could accidentally break ECN, this patch tries to
> > fix this in a way so we don't break shell scripts depending on the old
> > behavior while still being transparent to ECN. This quietly fixes ECN
> > behavior for old setups.
> > 
> > For IPv6 we have no check if we check for ECN bits, in IPv4 we only
> > check for the last bit, which is specified to be '0' from pre-DSCP times
> > (because of implementation confusion).
> > 
> > This patch changes fib rules in a way that matches only for ecn bits
> > will never match from now on (I consider them illegal), as we simply
> > ignore those rules (it was easier to explain in a pr_warn). Opinions?
> 
>       Well, may be the confusion comes from commit 89aef8921bfb
> ("ipv4: Delete routing cache.") where the 'tos &= IPTOS_RT_MASK;'
> line is lost from ip_route_input_common. I think, we should
> add it back, so that we can properly match input routes with rules
> that specify tos value. Old kernels didn't stored ECN bits in
> flowi4_tos in the input path, so we should do the same.

I would love to have done that but was fearing problems with user space
compatibility. Also IPTOS_RT_MASK is not enough for filtering, we need
to check for the whole INET_ECN_MASK.

> 
> > Cc: Florian Westphal <f...@strlen.de>
> > Signed-off-by: Hannes Frederic Sowa <han...@stressinduktion.org>
> > ---
> >  include/net/inet_ecn.h | 5 +++++
> >  net/ipv4/fib_rules.c   | 6 +++++-
> >  net/ipv6/fib6_rules.c  | 7 ++++++-
> >  3 files changed, 16 insertions(+), 2 deletions(-)
> > 
> > diff --git a/include/net/inet_ecn.h b/include/net/inet_ecn.h
> > index 0dc0a51da38faa..be65d94c05ee01 100644
> > --- a/include/net/inet_ecn.h
> > +++ b/include/net/inet_ecn.h
> > @@ -17,6 +17,11 @@ enum {
> >  
> >  extern int sysctl_tunnel_ecn_log;
> >  
> > +static inline __u8 INET_ECN_ignore(__u8 dsfield)
> > +{
> > +   return dsfield & ~INET_ECN_MASK;
> > +}
> > +
> >  static inline int INET_ECN_is_ce(__u8 dsfield)
> >  {
> >     return (dsfield & INET_ECN_MASK) == INET_ECN_CE;
> > diff --git a/net/ipv4/fib_rules.c b/net/ipv4/fib_rules.c
> > index 6e9ea69e5f751b..87efbc38589c76 100644
> > --- a/net/ipv4/fib_rules.c
> > +++ b/net/ipv4/fib_rules.c
> > @@ -145,7 +145,8 @@ static int fib4_rule_match(struct fib_rule *rule, 
> > struct flowi *fl, int flags)
> >         ((daddr ^ r->dst) & r->dstmask))
> >             return 0;
> >  
> > -   if (r->tos && (r->tos != fl4->flowi4_tos))
> > +   if (!INET_ECN_ignore(r->tos) ||
> 
>       Above check will succeed for any rule that does not use
> tos (r->tos = 0). So, rules without tos will return 0 (no match).

Oh yes, certainly. I switched back and forth between different schemas.
This obviously doesn't work, thanks! I actually only tested the cases
with tos rules. :(

> > +       INET_ECN_ignore(r->tos) != INET_ECN_ignore(fl4->flowi4_tos))
> >             return 0;
> 
>       fib4_rule_configure already rejects ECN bits in r->tos,
> so no need to filter them again in fast path.

The problem is that IPTOS_TOS_MASK is just the masking of the 0x1 bit,
not both ECN bits. Thi stems from the pre-DSCP time where it was
forbidden to use bit 0x1 in TOS.

> >     return 1;
> > @@ -215,6 +216,9 @@ static int fib4_rule_configure(struct fib_rule *rule, 
> > struct sk_buff *skb,
> >     rule4->dst_len = frh->dst_len;
> >     rule4->dstmask = inet_make_mask(rule4->dst_len);
> >     rule4->tos = frh->tos;
> > +   if (!INET_ECN_ignore(rule4->tos))
> > +           pr_warn("ipv4: never matching ipv4 rule with match for %x 
> > added\n",
> > +                   rule4->tos);
> 
>       You should see this warning for non-tos rules...

I added the warnings after testing, sorry...

Bye,
Hannes

Reply via email to