From: Ashutosh Kaushik <k.ashut...@samsung.com>

The patch fixes the issues with check of global flag "rps_needed" in RX Path 
(which process packets in TCP/IP stack like netif_rx and netif_receive_skb 
functions)
These functions have flag CONFIG RPS which is enabled default in kernel and to 
enter in RPS mode, it depends on variable rps_needed.
This variable is updated whenever value in 
/sys/class/net/<device>/queues/rx-0/rps_cpus is being changed.
There are 2 scenarios where it is executing extra piece of code even when 
results would be same every time:-

1) Suppose in system more than one networking devices are connected i.e. wired 
(eth0) and wireless (wlan0). 
   If I enable RPS using above method for wlan0, then it sets atomic variable 
in rps_needed flag which is global.
   Now,whenever traffic uses wired network, then it will execute that extra 
piece of code in RX path because of only dependency on global rps_needed 
variable as below:
  #ifdef CONFIG_RPS
        if (static_key_false(&rps_needed)) {
                struct rps_dev_flow voidflow, *rflow = &voidflow;
                int cpu = get_rps_cpu(skb->dev, skb, &rflow);

                if (cpu >= 0) {
                        ret = enqueue_to_backlog(skb, cpu, &rflow->last_qtail);
                        rcu_read_unlock();
                        return ret;
                }
        }
  #endif
 
        while every time, value returned from get_rps_cpu will be < 0 as 
rps_map value is not set for this network device using 
/sys/class/net/device/queues/rx-0/rps_cpus.
        And it will every time execute get_rps_cpu function with fail case in 
IF condition which will be an extra overhead for packet processing.
        
2) Another scenario is as: Suppose that we have enable RPS for wireless device 
say wlan0 using above specified method which will set value of rps_needed. 
   After doing test with set value of rps_cpus in sysfs, we do rmmod driver of 
wireless device.
   Next time again when we do insmod without rebooting system, it always hit 
below code with fail case:
   #ifdef CONFIG_RPS
        if (static_key_false(&rps_needed)) {
                struct rps_dev_flow voidflow, *rflow = &voidflow;
                int cpu;

                preempt_disable();
                rcu_read_lock();

                cpu = get_rps_cpu(skb->dev, skb, &rflow);
                if (cpu < 0)
                        cpu = smp_processor_id();

                ret = enqueue_to_backlog(skb, cpu, &rflow->last_qtail);

                rcu_read_unlock();
                preempt_enable();
        } else
   #endif
   The reason behind this overhead of hitting this code with false case is same 
because before doing rmmod, we enabled RPS which set rps_needed flag.
   Next time when we do insmod, it will just create entry for network device in 
sysfs with default value which is 0 for rps_cpus. It implies that RPS is 
disable for that device.
   But due to unchanged value of rps_needed variable, it goes into IF condition 
every time (which is failed in get_rps_cpu function) even rps_cpus is 0.
   Because if we do not enable RPS for that network device, rps_map is not set.
   
The patch adds a check to these two RX functions which will check RPS 
availability locally for device specific.
 
Signed-off-by: Ashutosh Kaushik <k.ashut...@samsung.com>
---
 net/core/dev.c | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/net/core/dev.c b/net/core/dev.c
index 5df6cbc..1aa4402 100644
--- a/net/core/dev.c
+++ b/net/core/dev.c
@@ -3531,12 +3531,14 @@ drop:
 static int netif_rx_internal(struct sk_buff *skb)
 {
        int ret;
+       struct netdev_rx_queue *dev_rxqueue = skb->dev->_rx;
 
        net_timestamp_check(netdev_tstamp_prequeue, skb);
 
        trace_netif_rx(skb);
 #ifdef CONFIG_RPS
-       if (static_key_false(&rps_needed)) {
+       if (static_key_false(&rps_needed) &&
+           dev_rxqueue->rps_map && dev_rxqueue->rps_map->len) {
                struct rps_dev_flow voidflow, *rflow = &voidflow;
                int cpu;
 
@@ -3986,6 +3988,7 @@ static int __netif_receive_skb(struct sk_buff *skb)
 static int netif_receive_skb_internal(struct sk_buff *skb)
 {
        int ret;
+       struct netdev_rx_queue *dev_rxqueue = skb->dev->_rx;
 
        net_timestamp_check(netdev_tstamp_prequeue, skb);
 
@@ -3995,7 +3998,8 @@ static int netif_receive_skb_internal(struct sk_buff *skb)
        rcu_read_lock();
 
 #ifdef CONFIG_RPS
-       if (static_key_false(&rps_needed)) {
+       if (static_key_false(&rps_needed) &&
+           dev_rxqueue->rps_map && dev_rxqueue->rps_map->len) {
                struct rps_dev_flow voidflow, *rflow = &voidflow;
                int cpu = get_rps_cpu(skb->dev, skb, &rflow);
 
-- 
1.9.1

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to