Hello,

While reading the code of net/core/sock_reuseport.c, I think I found a
possible race in reuseport_select_sock. The current code has the
following pattern:

   socks = READ_ONCE(reuse->num_socks);
   smp_rmb(); // paired with reuseport_add_sock to make sure
reuse->socks[i < num_socks] are initialized
   while (reuse->socks[i]->sk_state == TCP_ESTABLISHED) {
        if (i >= reuse->num_socks) // should be "socks" and not
"reuse->num_socks"?

The barrier seems to be here to make sure that all items of
reuse->socks are initialized before being read, but the barrier only
protects indexes < socks, not indexes < reuse->num_socks.

I have a patch ready to fix this issue, but I wanted to confirm that
the current code is indeed incorrect (if the code is correct for a
non-obvious reason, I'd be happy to add some comments to document the
behavior).


Here is the diff of the patch I was planning to submit:

diff --git a/net/core/sock_reuseport.c b/net/core/sock_reuseport.c
index bbdd3c7b6cb5..b065f0a103ed 100644
--- a/net/core/sock_reuseport.c
+++ b/net/core/sock_reuseport.c
@@ -293,7 +293,7 @@ struct sock *reuseport_select_sock(struct sock *sk,
             i = j = reciprocal_scale(hash, socks);
             while (reuse->socks[i]->sk_state == TCP_ESTABLISHED) {
                 i++;
-                if (i >= reuse->num_socks)
+                if (i >= socks)
                     i = 0;
                 if (i == j)
                     goto out;


Thanks,
Baptiste.

Reply via email to