From: Jiri Pirko <j...@mellanox.com>

There a is need for some userspace API that would allow to expose things
that are not directly related to any device class like net_device of
ib_device, but rather chip-wide/switch-ASIC-wide stuff.

Use cases:
1) get/set of port type (Ethernet/InfiniBand)
2) setting up port splitters - split port into multiple ones and squash again,
   enables usage of splitter cable
3) setting up shared buffers - shared among multiple ports within
   one chip (work in progress)
4) configuration of switch wide properties - resources division etc - This will
   allow to pass configuration that is unacceptable to be passed as
   a module option.

First patch of this set introduces a new generic Netlink based interface,
called "devlink". It is similar to nl80211 model and it is heavily
influenced by it, including the API definition. The devlink introduction patch
implements use cases 1) and 2). Other 2 are in development atm and will
be addressed by follow-ups.

It is very convenient for drivers to use devlink, as you can see in other
patches in this set.

Counterpart for devlink is userspace tool for now called "dl". Command line
interface and outputs are derived from "ip" tool so it should be easy
for users to get used to it.

It is available here as a standalone tool for now:
https://github.com/jpirko/devlink
After this is merge in kernel, I will include the "dl" or "devlink" tool
into iproute2 toolset.

Port type setting example:
        myhost:~$ dl help
        Usage: dl [ OPTIONS ] OBJECT { COMMAND | help }
        where  OBJECT := { dev | port | monitor }
               OPTIONS := { -v/--verbose }
        
        myhost:~$ dl dev help
        Usage: dl dev show [DEV]
        
        myhost:~$ dl dev show
        pci/0000:01:00.0
        
        myhost:~$ dl port help
        Usage: dl port show [DEV/PORT_INDEX]
        Usage: dl port set DEV/PORT_INDEX [ type { eth | ib | auto} ]
        Usage: dl port split DEV/PORT_INDEX count
        Usage: dl port unsplit DEV/PORT_INDEX
        
        myhost:~$ dl port show
        pci/0000:01:00.0/1: type ib ibdev mlx4_0
        pci/0000:01:00.0/2: type ib ibdev mlx4_0
        
        myhost:~$ sudo dl port set pci/0000:01:00.0/1 type eth
        
        myhost:~$ dl port show
        pci/0000:01:00.0/1: type eth netdev ens4
        pci/0000:01:00.0/2: type ib ibdev mlx4_0
        
        myhost:~$ sudo dl port set ens4 type auto
        
        myhost:~$ dl port show
        pci/0000:01:00.0/1: type eth(auto) netdev ens4
        pci/0000:01:00.0/2: type ib ibdev mlx4_0

Port splitting example:
        myswitch:~$ sudo modprobe mlxsw_pci
        myswitch:~$ dl port
        pci/0000:03:00.0/1: type eth netdev eth0
        pci/0000:03:00.0/3: type eth netdev eth1
        pci/0000:03:00.0/5: type eth netdev eth2
        ...
        pci/0000:03:00.0/63: type eth netdev eth31
        
        myswitch:~$ sudo dl port split pci/0000:03:00.0/1 2   (or "sudo dl port 
split eth0 2")
        
        myswitch:~$ dl port
        pci/0000:03:00.0/3: type eth netdev eth1
        pci/0000:03:00.0/5: type eth netdev eth2
        ...
        pci/0000:03:00.0/63: type eth netdev eth31
        pci/0000:03:00.0/1: type eth netdev eth0 split_group 16
        pci/0000:03:00.0/2: type eth netdev eth32 split_group 16
        
        myswitch:~$ sudo dl port unsplit pci/0000:03:00.0/1
        
        myswitch:~$ dl port
        pci/0000:03:00.0/3: type eth netdev eth1
        pci/0000:03:00.0/5: type eth netdev eth2
        pci/0000:03:00.0/63: type eth netdev eth31
        pci/0000:03:00.0/1: type eth netdev eth0

---
v2->v3:
patch 1/9
 -removed generated devlink index and name, use bus name and dev name as
  a handle for all userspace originated commands. Along with that,
  remove sysfs stub. Requested by Hannes Sowa.
patch 2/9
 -add dev param to devlink_register (api change)
patch 4/9
 -add dev param to devlink_register (api change)
patch 9/9
 -set port's speed according to width fix by Ido
v1->v2:
patch 1/9
 -removed no longer used "devlink_dev" helper
 -fix couple of typos and misspells
patch 4/9:
 -removed SET_NETDEV_DEV set to devlink dev

Ido Schimmel (4):
  mlxsw: spectrum: Unmap local port from module during teardown
  mlxsw: spectrum: Store local port to module mapping during init
  mlxsw: spectrum: Mark unused ports using NULL
  mlxsw: spectrum: Introduce port splitting

Jiri Pirko (5):
  Introduce devlink infrastructure
  mlx4: Implement devlink interface
  mlx4: Implement port type setting via devlink interface
  mlxsw: Implement devlink interface
  mlxsw: core: Add devlink port splitter callbacks

 MAINTAINERS                                    |   8 +
 drivers/infiniband/hw/mlx4/main.c              |   7 +
 drivers/net/ethernet/mellanox/mlx4/en_netdev.c |   8 +-
 drivers/net/ethernet/mellanox/mlx4/intf.c      |   9 +
 drivers/net/ethernet/mellanox/mlx4/main.c      | 128 +++--
 drivers/net/ethernet/mellanox/mlx4/mlx4.h      |   2 +
 drivers/net/ethernet/mellanox/mlxsw/core.c     |  55 +-
 drivers/net/ethernet/mellanox/mlxsw/core.h     |   2 +
 drivers/net/ethernet/mellanox/mlxsw/port.h     |   2 +
 drivers/net/ethernet/mellanox/mlxsw/spectrum.c | 272 ++++++++-
 drivers/net/ethernet/mellanox/mlxsw/spectrum.h |  10 +-
 drivers/net/ethernet/mellanox/mlxsw/switchx2.c |  20 +
 include/linux/mlx4/driver.h                    |   3 +
 include/net/devlink.h                          | 140 +++++
 include/uapi/linux/devlink.h                   |  72 +++
 net/Kconfig                                    |   7 +
 net/core/Makefile                              |   1 +
 net/core/devlink.c                             | 738 +++++++++++++++++++++++++
 18 files changed, 1425 insertions(+), 59 deletions(-)
 create mode 100644 include/net/devlink.h
 create mode 100644 include/uapi/linux/devlink.h
 create mode 100644 net/core/devlink.c

-- 
2.5.0

Reply via email to