Installing Mellanox OFED in Xenial with Kernel 4.4:

# lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 16.04.1 LTS
Release:        16.04
Codename:       xenial

# uname -a
Linux heatmor 4.4.0-36-generic #55-Ubuntu SMP Thu Aug 11 18:01:55 UTC 2016 
x86_64 x86_64 x86_64 GNU/Linux

And, after configuring SR-IOV for ConnectX-4 (after configured with
mlxconfig):

# lspci | grep -i mellanox
08:00.0 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4]
08:00.1 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4]
08:00.2 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4 
Virtual Function]
08:00.3 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4 
Virtual Function]
08:00.4 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4 
Virtual Function]
08:00.5 Infiniband controller: Mellanox Technologies MT27700 Family [ConnectX-4 
Virtual Function]

echo 4 > /sys/class/infiniband/mlx5_0/device/sriov_numvfs

echo Follow > /sys/class/infiniband/mlx5_0/device/sriov/0/policy

echo e4:1d:2d:03:00:af:4f:06 > /sys/class/infiniband/mlx5_0/device/sriov/0/node
echo e4:1d:2d:03:00:af:5f:06 > /sys/class/infiniband/mlx5_0/device/sriov/0/port

echo e4:1d:2d:03:00:af:4f:07 > /sys/class/infiniband/mlx5_0/device/sriov/1/node
echo e4:1d:2d:03:00:af:5f:07 > /sys/class/infiniband/mlx5_0/device/sriov/1/port

echo e4:1d:2d:03:00:af:4f:08 > /sys/class/infiniband/mlx5_0/device/sriov/2/node
echo e4:1d:2d:03:00:af:5f:08 > /sys/class/infiniband/mlx5_0/device/sriov/2/port

echo e4:1d:2d:03:00:af:4f:09 > /sys/class/infiniband/mlx5_0/device/sriov/3/node
echo e4:1d:2d:03:00:af:5f:09 > /sys/class/infiniband/mlx5_0/device/sriov/3/port

echo 0000:08:00.2 > /sys/bus/pci/drivers/mlx5_core/unbind
echo 0000:08:00.2 > /sys/bus/pci/drivers/mlx5_core/bind

echo 0000:08:00.3 > /sys/bus/pci/drivers/mlx5_core/unbind
echo 0000:08:00.3 > /sys/bus/pci/drivers/mlx5_core/bind

echo 0000:08:00.4 > /sys/bus/pci/drivers/mlx5_core/unbind
echo 0000:08:00.4 > /sys/bus/pci/drivers/mlx5_core/bind

echo 0000:08:00.5 > /sys/bus/pci/drivers/mlx5_core/unbind
echo 0000:08:00.5 > /sys/bus/pci/drivers/mlx5_core/bind

And attaching this XML to a guest:

<interface type='hostdev' managed='yes'>
    <source>
      <address type='pci' domain='0' bus='8' slot='0' function='2'/>
        </source>
</interface>

root@heatmor:~# virsh attach-device ibdhcprelay ./new-device.xml
--config

I can't start the guest in question:

# virsh start ibdhcprelay
error: Failed to start domain ibdhcprelay
error: internal error: missing IFLA_VF_INFO in netlink response

Looks like there is an incompatibility between Mellanox OFED DKMS
packages and Ubuntu kernel (specifically about netlink support).

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1496942

Title:
  Infiniband (mellanox) SR-IOV and libvirt + libnl problems

Status in linux package in Ubuntu:
  Triaged

Bug description:
  When trying to start an IB SR-IOV guest by using the following XML:

      <interface type='hostdev' managed='yes'>
        <mac address='52:54:00:70:ba:16'/>
        <source>
          <address type='pci' domain='0x0000' bus='0x08' slot='0x00' 
function='0x1'/>
        </source>
        <address type='pci' domain='0x0000' bus='0x00' slot='0x08' 
function='0x0'/>
      </interface>

  following the Mellanox SR-IOV guide, we are able to start guests using
  kernel 3.16 (Utopic).

  We are NOT able to start guests using 3.13 OR 3.19. The following
  error occurs:

  2015-09-17 02:25:07.208+0000: 52157: info : libvirt version: 1.2.12, package: 
1.2.12-0ubuntu14.1~cloud0
  2015-09-17 02:25:07.208+0000: 52157: error : virSecurityDriverLookup:80 : 
unsupported configuration: Security driver apparmor not enabled
  2015-09-17 02:25:42.308+0000: 52281: info : libvirt version: 1.2.12, package: 
1.2.12-0ubuntu14.1~cloud0
  2015-09-17 02:25:42.308+0000: 52281: error : virSecurityDriverLookup:80 : 
unsupported configuration: Security driver apparmor not enabled
  2015-09-17 02:25:48.996+0000: 52274: error : virNetDevParseVfConfig:1905 : 
internal error: missing IFLA_VF_INFO in netlink response
  2015-09-17 02:25:49.006+0000: 52274: error : virFileReadAll:1347 : Failed to 
open file '/var/run/libvirt/hostdevmgr/ib0_vf0': No such file or directory
  2015-09-17 02:25:49.006+0000: 52274: error : virFileReadAll:1347 : Failed to 
open file '/var/run/libvirt/qemu/ib0_vf0': No such file or directory

  So probably there is some regression in between 3.16 and 3.19 for the
  IFLA_VF_INFO feature from netlink AND this has to be backported to
  kernel 3.13 for Trusty to have IB SR-IOV working.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1496942/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to