On Fri, 15 Jul 2016 09:47:46 -0700 Alexei Starovoitov <alexei.starovoi...@gmail.com> wrote: > On Fri, Jul 15, 2016 at 09:18:13AM -0700, Tom Herbert wrote: [..] > > > We don't need extra comlexity of figuring out number of rings and > > > struggling with lack of atomicity. > > > > We already have this problem with other per ring configuration. > > not really. without atomicity of the program change, the user space > daemon that controls it will struggle to adjust. Consider the case > where we're pushing new update for loadbalancer. In such case we > want to reuse the established bpf map, since we cannot atomically > move it from old to new, but we want to swap the program that uses > in one go, otherwise two different programs will be accessing > the same map. Technically it's valid, but difference in the programs > may cause issues. Lack of atomicity is not intractable problem, > it just makes user space quite a bit more complex for no reason.
I don't think you have a problem with updating the program per queue basis, as they will be updated atomically per RX queue (thus a CPU can only see one program). Today, you already have to handle that multiple CPUs running the same program, need to access the same map. You mention that, there might be a problem, if the program differs too much to share the map. But that is the same problem as today. If you need to load a program that e.g. change the map layout, then you obviously cannot allow it inherit the old map, but must feed the new program a new map (with the new layout). There is actually a performance advantage of knowing that a program is only attached to a single RX queue. As only a single CPU can process a RX ring. Thus, when e.g. accessing a map (or other lookup table) you can avoid any locking. -- Best regards, Jesper Dangaard Brouer MSc.CS, Principal Kernel Engineer at Red Hat Author of http://www.iptv-analyzer.org LinkedIn: http://www.linkedin.com/in/brouer