From: Florian Westphal <f...@strlen.de>
Date: Sat,  2 Dec 2017 21:44:04 +0100

> Peter Zijlstra reported (referring to commit 019a316992ee0d983,
> "rtnetlink: add reference counting to prevent module unload while dump is in 
> progress"):
> 
>  1) it not in fact a refcount, so using refcount_t is silly
>  2) there is a distinct lack of memory barriers, so we can easily
>     observe the decrement while the msg_handler is still in progress.
>  3) waiting with a schedule()/yield() loop is complete crap and subject
>     life-locks, imagine doing that rtnl_unregister_all() from a RT task.
> 
> In ancient times rtnetlink exposed a statically-sized table with
> preset doit/dumpit handlers to be called for a protocol/type pair.
> 
> Later the rtnl_register interface was added and the table was allocated
> on demand.  Eventually these were also used by modules.
> 
> Problem is that nothing prevents module unload while a netlink dump
> is in progress.  netlink dumps can be span multiple recv calls and
> netlink core saves the to-be-repeated dumper address for later invocation.
> 
> To prevent rmmod the netlink core expects callers to pass in the owning
> module so a reference can be taken.
> 
> So far rtnetlink wasn't doing this, add new interface to pass THIS_MODULE.
> Moreover, when converting parts of the rtnetlink handling to rcu this code
> gained way too many READ_ONCE spots, remove them and the extra refcounting.
> 
> Take a module reference when running dumpit and doit callbacks
> and never alter content of rtnl_link structures after they have been
> published via rcu_assign_pointer.
> 
> Based partially on earlier patch from Peter.

Series applied, thanks Florian.

BTW, why did you use rtnl_register_module() for most of ipv6 but
specifically not for the net/ipv6/addrconf.c stuff?

Just curious.

Reply via email to