The environment has customer data, so collecting with apport-collect is
unfortunately not possible.

Kernel version on the affected machines: 5.4.0-124-generic

Please see `lspci` and `lshw` output below.

```
sudo lshw -class network | egrep "(ens1f0np0|ens4f0np0)" -A 8 -B 6
  *-network:0
       description: Ethernet interface
       product: BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller
       vendor: Broadcom Inc. and subsidiaries
       physical id: 0
       bus info: pci@0000:12:00.0
       logical name: ens1f0np0
       version: 01
       serial: f4:03:43:ee:89:90
       capacity: 25Gbit/s
       width: 64 bits
       clock: 33MHz
       capabilities: pm vpd msi msix pciexpress bus_master cap_list rom 
ethernet physical fibre 1000bt-fd 25000bt-fd autonegotiation
       configuration: autonegotiation=on broadcast=yes driver=bnxt_en 
driverversion=1.10.0 duplex=full firmware=218.0.152.0/pkg 218.0.166000 
latency=0 link=yes multicast=yes port=fibre slave=yes
       resources: irq:26 memory:de610000-de61ffff memory:de500000-de5fffff 
memory:de622000-de623fff memory:de800000-de83ffff
--
  *-network:0
       description: Ethernet interface
       product: BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller
       vendor: Broadcom Inc. and subsidiaries
       physical id: 0
       bus info: pci@0000:af:00.0
       logical name: ens4f0np0
       version: 01
       serial: f4:03:43:ee:89:90
       capacity: 25Gbit/s
       width: 64 bits
       clock: 33MHz
       capabilities: pm vpd msi msix pciexpress bus_master cap_list rom 
ethernet physical fibre 1000bt-fd 25000bt-fd autonegotiation
       configuration: autonegotiation=on broadcast=yes driver=bnxt_en 
driverversion=1.10.0 duplex=full firmware=218.0.152.0/pkg 218.0.166000 
latency=0 link=yes multicast=yes port=fibre slave=yes
       resources: irq:42 memory:f3a10000-f3a1ffff memory:f3900000-f39fffff 
memory:f3a22000-f3a23fff memory:f3c00000-f3c3ffff


sudo lspci | grep 57414
12:00.0 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)
12:00.1 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)
af:00.0 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)
af:00.1 Ethernet controller: Broadcom Inc. and subsidiaries BCM57414 
NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller (rev 01)
```


** Changed in: linux (Ubuntu)
       Status: Incomplete => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1983468

Title:
  tx checksumming offload results in TCP/UDP packet drops (was Octavia
  amphora loadbalancer gets stuck at PENDING_CREATE status)

Status in OpenStack Octavia Charm:
  Invalid
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  In a new focal-yoga deployment, the creation of a loadbalancer gets
  stuck at PENDING_CREATE status.

  Checking the amphora we could see that it stays at BOOTING status:

  $ openstack loadbalancer amphora show ef48089d-ba40-46db-92e8-e369f764f017 
--format yaml
  id: ef48089d-ba40-46db-92e8-e369f764f017
  loadbalancer_id: dcd17d9e-6a27-43c5-9c3f-eb2b2655556d
  compute_id: 5efe11a8-93d8-4278-94c2-4efc8b015009
  lb_network_ip: fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa
  vrrp_ip: null
  ha_ip: null
  vrrp_port_id: null
  ha_port_id: null
  cert_expiration: '2022-09-01T20:27:05'
  cert_busy: false
  role: null
  status: BOOTING
  vrrp_interface: null
  vrrp_id: null
  vrrp_priority: null
  cached_zone: nova
  created_at: '2022-08-02T20:27:05'
  updated_at: '2022-08-02T20:30:13'
  image_id: 6c6cd911-197f-45d3-a6d5-4ff1789d4ee7
  compute_flavor: 638fa4c5-e81b-438f-a12b-1ef7faf81c3e

  
  /var/log/octavia/octavia-worker.log shows several warnings about connection 
failure to the amphora.

  2022-08-02 20:30:22.589 149659 WARNING
  octavia.amphorae.drivers.haproxy.rest_api_driver [-] Could not connect
  to instance. Retrying.: requests.exceptions.ConnectTimeout:
  HTTPSConnectionPool(host='fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa',
  port=9443): Max retries exceeded with url: // (Caused by
  ConnectTimeoutError(<urllib3.connection.VerifiedHTTPSConnection object
  at 0x7f87a655fc70>, 'Connection to
  fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa timed out. (connect
  timeout=10.0)'))

  
  It is possible to ping the amphora from an octavia unit:

  # ping -M do -s 1452 fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa
  PING 
fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa(fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa)
 1452 data bytes
  1460 bytes from fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa: icmp_seq=1 ttl=64 
time=2.45 ms
  1460 bytes from fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa: icmp_seq=2 ttl=64 
time=1.01 ms
  1460 bytes from fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa: icmp_seq=3 ttl=64 
time=0.532 ms
  1460 bytes from fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa: icmp_seq=4 ttl=64 
time=0.417 ms

  Also, the port tcp/22 is reacheable from the octavia unit:

  # telnet fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa 22
  Trying fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa...
  Connected to fc00:b81a:629a:59a6:f816:3eff:fe0a:68fa.
  Escape character is '^]'.
  SSH-2.0-OpenSSH_8.2p1 Ubuntu-4ubuntu0.5

  After running the config-changed hook as described at the bug
  https://bugs.launchpad.net/charm-octavia/+bug/1961088 , the creation
  of a loadbalancer ends with a ERROR provisioning_status.

To manage notifications about this bug go to:
https://bugs.launchpad.net/charm-octavia/+bug/1983468/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to