Public bug reported:

Host OS:
Ubuntu 24.04.2 LTS
Kernel 6.11.0-21-generic
CPU: AMD Ryzen 9 5900X
Software Firmware version: F2
GPU 1: AMD Radeon RX 6400 (Used by Host OS)
GPU 2: AMD Radeon RX 6800 (Used by VMs via GPU passthrough, on PCI bus 10:00.0)

$ apt-cache policy linux-firmware
linux-firmware:
  Installed: 20240318.git3b128b60-0ubuntu2.11
  Candidate: 20240318.git3b128b60-0ubuntu2.11
  Version table:
 *** 20240318.git3b128b60-0ubuntu2.11 500
        500 http://us.archive.ubuntu.com/ubuntu noble-updates/main amd64 
Packages
        500 http://security.ubuntu.com/ubuntu noble-security/main amd64 Packages
        100 /var/lib/dpkg/status
     20240318.git3b128b60-0ubuntu2 500
        500 http://us.archive.ubuntu.com/ubuntu noble/main amd64 Packages

What should have happened:

VM with GPU passthrough should start

What happend instead:

VM with GPU passthrough wouldn't start. I tried running 'lspci -nns
0000:10:00.0' but this hung the terminal. Virtual Machine Manager was
now showing it couldn't connect to the KVM daemon. I rebooted the Host
OS but running 'lspci -nns 0000:10:00.0' again hung and I still couldn't
start the VM with GPU passthrough.

Extra info:

After installing updates to the Host OS on 2025-4-10, VMs without GPU
passthrough worked fine. On 2025-4-12 I tried to start a VM with GPU
passthrough but it wouldn't start.

On 2025-4-10 one of the Host OS updates was linux-firmware:amd64
(20240318.git3b128b60-0ubuntu2.10 -> 20240318.git3b128b60-0ubuntu2.11).

I wanted to test downgrading the linux-firmware back to version 2.10 but
that is no longer available. I was able to find, from this launchpad,
the files that were in the 2.10 and 2.11 versions of linux-firmware. I
found the differences between the files for the amdgpu firmware files. I
overwrote the /lib/firmware/amdgpu files on my host OS with the files
from 2.10 and rebooted - the VM with GPU passthrough was able to start
(and the lspci command worked.)

The list of amdgpu firmware files I overwrote was:

gc_11_5_1_imu.bin.zst
gc_11_5_1_me.bin.zst
gc_11_5_1_mec.bin.zst
gc_11_5_1_mes1.bin.zst
gc_11_5_1_mes_2.bin.zst
gc_11_5_1_pfp.bin.zst
gc_11_5_1_rlc.bin.zst
isp_4_1_1.bin.zst
psp_14_0_1_ta.bin.zst
psp_14_0_1_toc.bin.zst
sdma_6_1_1.bin.zst
vcn_4_0_6_1.bin.zst
vcn_4_0_6.bin.zst
vpe_6_1_1.bin.zst

** Affects: linux-firmware (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/2107285

Title:
  KVM VM with GPU passthrough won't start

Status in linux-firmware package in Ubuntu:
  New

Bug description:
  Host OS:
  Ubuntu 24.04.2 LTS
  Kernel 6.11.0-21-generic
  CPU: AMD Ryzen 9 5900X
  Software Firmware version: F2
  GPU 1: AMD Radeon RX 6400 (Used by Host OS)
  GPU 2: AMD Radeon RX 6800 (Used by VMs via GPU passthrough, on PCI bus 
10:00.0)

  $ apt-cache policy linux-firmware
  linux-firmware:
    Installed: 20240318.git3b128b60-0ubuntu2.11
    Candidate: 20240318.git3b128b60-0ubuntu2.11
    Version table:
   *** 20240318.git3b128b60-0ubuntu2.11 500
          500 http://us.archive.ubuntu.com/ubuntu noble-updates/main amd64 
Packages
          500 http://security.ubuntu.com/ubuntu noble-security/main amd64 
Packages
          100 /var/lib/dpkg/status
       20240318.git3b128b60-0ubuntu2 500
          500 http://us.archive.ubuntu.com/ubuntu noble/main amd64 Packages

  What should have happened:

  VM with GPU passthrough should start

  What happend instead:

  VM with GPU passthrough wouldn't start. I tried running 'lspci -nns
  0000:10:00.0' but this hung the terminal. Virtual Machine Manager was
  now showing it couldn't connect to the KVM daemon. I rebooted the Host
  OS but running 'lspci -nns 0000:10:00.0' again hung and I still
  couldn't start the VM with GPU passthrough.

  Extra info:

  After installing updates to the Host OS on 2025-4-10, VMs without GPU
  passthrough worked fine. On 2025-4-12 I tried to start a VM with GPU
  passthrough but it wouldn't start.

  On 2025-4-10 one of the Host OS updates was linux-firmware:amd64
  (20240318.git3b128b60-0ubuntu2.10 ->
  20240318.git3b128b60-0ubuntu2.11).

  I wanted to test downgrading the linux-firmware back to version 2.10
  but that is no longer available. I was able to find, from this
  launchpad, the files that were in the 2.10 and 2.11 versions of linux-
  firmware. I found the differences between the files for the amdgpu
  firmware files. I overwrote the /lib/firmware/amdgpu files on my host
  OS with the files from 2.10 and rebooted - the VM with GPU passthrough
  was able to start (and the lspci command worked.)

  The list of amdgpu firmware files I overwrote was:

  gc_11_5_1_imu.bin.zst
  gc_11_5_1_me.bin.zst
  gc_11_5_1_mec.bin.zst
  gc_11_5_1_mes1.bin.zst
  gc_11_5_1_mes_2.bin.zst
  gc_11_5_1_pfp.bin.zst
  gc_11_5_1_rlc.bin.zst
  isp_4_1_1.bin.zst
  psp_14_0_1_ta.bin.zst
  psp_14_0_1_toc.bin.zst
  sdma_6_1_1.bin.zst
  vcn_4_0_6_1.bin.zst
  vcn_4_0_6.bin.zst
  vpe_6_1_1.bin.zst

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-firmware/+bug/2107285/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to