I tested netplan.io 0.107.1-3ubuntu0.22.04.2 from jammy-proposed, all looking 
good!
The intermittent failures reported in comment #12 are resolved.

First of all, the eswitch/switchdev functionality is not available on Jammy's 
GA 5.15 kernel,
so I upgraded to the HWE kernel and installed Netplan from proposed:

ubuntu@akis:~$ sudo devlink dev eswitch show pci/0000:86:00.0
kernel answers: Operation not supported
ubuntu@akis:~$ sudo apt-get install --install-recommends linux-generic-hwe-22.04
ubuntu@akis:~$ uname -a
Linux akis 6.8.0-51-generic #52~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Mon Dec  9 
15:00:52 UTC 2 x86_64 x86_64 x86_64 GNU/Linux
ubuntu@akis:~$ sudo apt install -t jammy-proposed netplan.io
ubuntu@akis:~$ apt list *netplan*
Listing... Done
libnetplan-dev/jammy-proposed 0.107.1-3ubuntu0.22.04.2 amd64
libnetplan0/jammy-proposed,now 0.107.1-3ubuntu0.22.04.2 amd64 
[installed,automatic]
netplan-generator/jammy-proposed,now 0.107.1-3ubuntu0.22.04.2 amd64 
[installed,automatic]
netplan.io/jammy-proposed,now 0.107.1-3ubuntu0.22.04.2 amd64 
[installed,automatic]
python3-netplan/jammy-proposed,now 0.107.1-3ubuntu0.22.04.2 amd64 
[installed,automatic]

Next, I identified the Mellanox ConnectX-5 NIC (enp134...0/1) and confrimed 
that LAG is disabled:
ubuntu@akis:~$ sudo lshw -c network -businfo
Bus info          Device          Class          Description
============================================================
pci@0000:06:00.0  enp6s0          network        I210 Gigabit Network Connection
pci@0000:35:00.0  enp53s0np0      network        MT27800 Family [ConnectX-5]
pci@0000:3a:00.0  enp58s0np0      network        MT27800 Family [ConnectX-5]
pci@0000:58:00.0  enp88s0np0      network        MT27800 Family [ConnectX-5]
pci@0000:5d:00.0  enp93s0np0      network        MT27800 Family [ConnectX-5]
pci@0000:86:00.0  enp134s0f0np0   network        MT27800 Family [ConnectX-5]
pci@0000:86:00.1  enp134s0f1np1   network        MT27800 Family [ConnectX-5]
pci@0000:b8:00.0  enp184s0np0     network        MT27800 Family [ConnectX-5]
pci@0000:bd:00.0  enp189s0np0     network        MT27800 Family [ConnectX-5]
pci@0000:e1:00.0  enp225s0np0     network        MT27800 Family [ConnectX-5]
pci@0000:e6:00.0  enp230s0np0     network        MT27800 Family [ConnectX-5]
ubuntu@akis:~$ sudo cat /sys/kernel/debug/mlx5/0000:86:00.0/lag/state
disabled
ubuntu@akis:~$ sudo cat /sys/kernel/debug/mlx5/0000:86:00.1/lag/state
disabled


I changed the Netplan configuration according to the test plan above, and 
rebooted the system:
ubuntu@akis:~$ sudo netplan get

** (process:3196): WARNING **: 14:22:14.283: `gateway4` has been deprecated, 
use default routes instead.
See the 'Default routes' section of the documentation for more details.
network:
  version: 2
  ethernets:
    enp134s0f0np0:
      optional: true
      virtual-function-count: 8
      embedded-switch-mode: "switchdev"
      delay-virtual-functions-rebind: true
    enp134s0f1np1:
      optional: true
      virtual-function-count: 8
      embedded-switch-mode: "switchdev"
      delay-virtual-functions-rebind: true
[...]
  bonds:
    bond0:
      interfaces:
      - enp134s0f0np0
      - enp134s0f1np1
      parameters:
        mode: "active-backup"


After the reboot the link-aggregation (LAG) is activated and the bond0 is up:
ubuntu@akis:~$ sudo cat /sys/kernel/debug/mlx5/0000:86:00.1/lag/state
active
ubuntu@akis:~$ sudo cat /sys/kernel/debug/mlx5/0000:86:00.0/lag/state
active
ubuntu@akis:~$ netplan status bond0
     Online state: online
    DNS Addresses: 127.0.0.53 (stub)
       DNS Search: maas

● 13: bond0 bond UP (networkd: bond0)
      MAC Address: ce:03:e9:7f:f9:9d
        Addresses: fe80::cc03:e9ff:fe7f:f99d/64 (link)
           Routes: fe80::/64 metric 256

12 inactive interfaces hidden. Use "--all" to show all.

** Tags removed: verification-needed verification-needed-jammy
** Tags added: verification-done verification-done-jammy

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1988018

Title:
  [SRU][mlx5] Intermittent VF-LAG activation failure

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1988018/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to