Re: [Patch net] net: saving irq context for peernet2id()

Cong Wang Thu, 20 Oct 2016 16:36:46 -0700

On Thu, Oct 20, 2016 at 12:07 PM, Paul Moore <p...@paul-moore.com> wrote:
> On Thu, Oct 20, 2016 at 2:29 PM, Cong Wang <xiyou.wangc...@gmail.com> wrote:
>> On Thu, Oct 20, 2016 at 7:58 AM, Stephen Smalley <s...@tycho.nsa.gov> wrote:
>>> On 10/20/2016 02:52 AM, Cong Wang wrote:
>>>> A kernel warning inside __local_bh_enable_ip() was reported by people
>>>> running SELinux, this is caused due to some SELinux functions
>>>> (indirectly) call peernet2id() with IRQ disabled in process context,
>>>> when we re-enable BH with IRQ disabled kernel complains. Shut up this
>>>> warning by saving IRQ context in peernet2id(), BH is still implicitly
>>>> disabled.
>>>
>>> Not sure this suffices; kill_fasync() -> send_sigio() ->
>>> send_sigio_to_task() -> sigio_perm() -> security_file_send_sigiotask()
>>> -> selinux_file_send_sigiotask() -> ... -> audit_log() -> ... ->
>>> peernet2id()
>>
>> Oh, this is a new one. kill_fasync() is called in IRQ handler, so we actually
>> do multicast in IRQ context.... It makes no sense, netlink multicast could
>> be very expensive if we have many listeners.
>
> I'm sure there are a few others I don't know about, but I believe the
> only commonly used audit multicast listener is systemd.


But user-space is free to listen to this group, right? If so this is just open
for a potential DDOS attack.

>
>> I am Cc'ing Richard who added that multicast in audit_log_end(). It seems
>> not easy to just move the multicast to a workqueue, since the skb is copied
>> from audit_buffer which is freed immediately after that, probably need 
>> another
>> queue like audit_skb_queue.
>
> This approach would double the queue size which is something I want to
> avoid.  I would suggest sticking with a single queue and dealing with
> the netlink message link fixup and multicast send in the existing
> netlink unicast thread; basically we would just be moving the
> multicast code from audit_log_end() into kauditd_thread().  This is
> the same approach I mentioned earlier off-list.

This is what I did in the follow up patch. I attach the updated version
in this email for you to review, I still can't make selinux-testsuites working
on my Fedora even though I have SELinux=enforcing, anyhow I don't
see any kernel warning in my dmesg at least.

>
> However, that isn't something I want to mess with as a regression fix,
> mostly because I really want to see this regression gone by -rc2 as it
> is making SELinux testing a real pain.  If the patch posted at the top
> of this thread isn't a suitable fix, we really should revert the
> original patch.

Since you want to test SELinux anyway, please test the attached one.

Thanks.

diff --git a/kernel/audit.c b/kernel/audit.c
index f1ca116..cdc5a91 100644
--- a/kernel/audit.c
+++ b/kernel/audit.c
@@ -139,6 +139,7 @@ static int     audit_freelist_count;
 static LIST_HEAD(audit_freelist);
 
 static struct sk_buff_head audit_skb_queue;
+static struct sk_buff_head audit_skb_multicast_queue;
 /* queue of skbs to send to auditd when/if it comes back */
 static struct sk_buff_head audit_skb_hold_queue;
 static struct task_struct *kauditd_task;
@@ -468,7 +469,8 @@ static void kauditd_send_multicast_skb(struct sk_buff *skb, 
gfp_t gfp_mask)
        if (!copy)
                return;
 
-       nlmsg_multicast(sock, copy, 0, AUDIT_NLGRP_READLOG, gfp_mask);
+       skb_queue_tail(&audit_skb_multicast_queue, copy);
+       wake_up_interruptible(&kauditd_wait);
 }
 
 /*
@@ -509,6 +511,26 @@ static void flush_hold_queue(void)
        consume_skb(skb);
 }
 
+static void flush_multicast_queue(void)
+{
+       struct audit_net *aunet = net_generic(&init_net, audit_net_id);
+       struct sock *sock = aunet->nlsk;
+       struct sk_buff *skb = skb_dequeue(&audit_skb_multicast_queue);
+
+       if (!netlink_has_listeners(sock, AUDIT_NLGRP_READLOG)) {
+               while (skb) {
+                       consume_skb(skb);
+                       skb = skb_dequeue(&audit_skb_multicast_queue);
+               }
+               return;
+       }
+
+       while (skb) {
+               nlmsg_multicast(sock, skb, 0, AUDIT_NLGRP_READLOG, GFP_KERNEL);
+               skb = skb_dequeue(&audit_skb_multicast_queue);
+       }
+}
+
 static int kauditd_thread(void *dummy)
 {
        set_freezable();
@@ -517,6 +539,8 @@ static int kauditd_thread(void *dummy)
 
                flush_hold_queue();
 
+               flush_multicast_queue();
+
                skb = skb_dequeue(&audit_skb_queue);
 
                if (skb) {
@@ -530,7 +554,8 @@ static int kauditd_thread(void *dummy)
                        continue;
                }
 
-               wait_event_freezable(kauditd_wait, 
skb_queue_len(&audit_skb_queue));
+               wait_event_freezable(kauditd_wait, 
skb_queue_len(&audit_skb_queue)
+                                                  || 
skb_queue_len(&audit_skb_multicast_queue));
        }
        return 0;
 }
@@ -1197,6 +1222,7 @@ static int __init audit_init(void)
        register_pernet_subsys(&audit_net_ops);
 
        skb_queue_head_init(&audit_skb_queue);
+       skb_queue_head_init(&audit_skb_multicast_queue);
        skb_queue_head_init(&audit_skb_hold_queue);
        audit_initialized = AUDIT_INITIALIZED;
        audit_enabled = audit_default;

Re: [Patch net] net: saving irq context for peernet2id()

Reply via email to