** Description changed:

+ [SRU Justification]
+ 
+ [Impact]
+ 
+ Products containing gfx1151 architecture with multiple microcontrollers
+ (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
+ loading or with stress applications on the CRB. This requires rebasing
+ these firmware versions to eliminate the risk.
+ 
+ [Fix]
+ 
+ * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
+ * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
+ * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
+ * d316e650c ("amdgpu: update gc 11.5.1 firmware")
+ * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
+ 
+ [Test Case]
+ 
+ This was reported by hardware vendor using proprietary GPU stress
+ software that is not freely available to generate heavy 3D rendering
+ workload.
+ 
+ [Where problems could occur]
+ 
+ Opaque GPU firmware limited to related platforms. There might be further
+ stability issues that need additional fixes from kernel, and we can only
+ find out with more deployments later.
+ 
+ [Other Info]
+ 
+ Nominate only for Noble and Oracular, because Plucky already has all of
+ them since version 20250204.git0fd450ee-0ubuntu1.
+ 
+ ========== original bug report ==========
+ 
  Products containing gfx1151 architecture with multiple microcontrollers
  (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
  loading or with stress applications on the CRB. This requires rebasing
  these firmware versions to eliminate the risk.
  
  # upstream tag 20250211
  * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
  # upstream tag 20250109
  # upstream tag 20241210
  * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
  * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
  * d316e650c ("amdgpu: update gc 11.5.1 firmware")
  # upstream tag 20241110
  # upstream tag 20240811
  * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
  # upstream tag 20240709
  
  [ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270433] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
  [ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) 
(0xa)
  [ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
  [ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
  [ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270454] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270470] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
- --- 
+ ---
  ProblemType: Bug
  ApportVersion: 2.28.1-0ubuntu3.3
  Architecture: amd64
  CRDA: N/A
  CasperMD5CheckResult: pass
  Dependencies: firmware-sof-signed 2023.12.1-1ubuntu1.4
  DistroRelease: Ubuntu 24.04
  InstallationDate: Installed on 2024-05-07 (308 days ago)
  InstallationMedia: Ubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424)
  IwConfig:
-  lo        no wireless extensions.
-  
-  enp193s0f0  no wireless extensions.
+  lo        no wireless extensions.
+ 
+  enp193s0f0  no wireless extensions.
  MachineType: AMD MAPLE
  Package: linux-firmware
  PackageArchitecture: amd64
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.11.0-1016-oem 
root=UUID=ff988e57-9bf4-46d6-94cf-1e35d0139e12 ro 
amdgpu.ip_block_mask=0xfffffcff quiet splash vt.handoff=7
  ProcVersionSignature: Ubuntu 6.11.0-1016.16-oem 6.11.11
  RelatedPackageVersions:
-  linux-restricted-modules-6.11.0-1016-oem N/A
-  linux-backports-modules-6.11.0-1016-oem  N/A
-  linux-firmware                           20240318.git3b128b60-0ubuntu2.10
+  linux-restricted-modules-6.11.0-1016-oem N/A
+  linux-backports-modules-6.11.0-1016-oem  N/A
+  linux-firmware                           20240318.git3b128b60-0ubuntu2.10
  RfKill:
-  
+ 
  Tags: noble package-from-proposed third-party-packages
  Uname: Linux 6.11.0-1016-oem x86_64
  UnreportableReason: This does not seem to be an official Ubuntu package. 
Please retry after updating the indexes of available packages, if that does not 
work then remove related third party packages and try again.
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: N/A
  _MarkForUpload: True
  dmi.bios.date: 04/25/2024 21:25:16
  dmi.bios.release: 0.0
  dmi.bios.vendor: AMD
  dmi.bios.version: RG60061C
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.name: MAPLE-STXH
  dmi.board.vendor: AMD
  dmi.board.version: RevB
  dmi.chassis.asset.tag: Chassis Asset Tag
  dmi.chassis.type: 10
  dmi.chassis.vendor: AMD
  dmi.chassis.version: 12345
  dmi.ec.firmware.release: 0.23
  dmi.modalias: 
dmi:bvnAMD:bvrRG60061C:bd04/25/2024212516:br0.0:efr0.23:svnAMD:pnMAPLE:pvrRG60061C:rvnAMD:rnMAPLE-STXH:rvrRevB:cvnAMD:ct10:cvr12345:sku12345678:
  dmi.product.family: STXH
  dmi.product.name: MAPLE
  dmi.product.sku: 12345678
  dmi.product.version: RG60061C
  dmi.sys.vendor: AMD

** Changed in: linux-firmware (Ubuntu Oracular)
       Status: New => In Progress

** Changed in: linux-firmware (Ubuntu Noble)
       Status: New => In Progress

** Changed in: linux-firmware (Ubuntu Noble)
   Importance: Undecided => High

** Changed in: linux-firmware (Ubuntu Noble)
     Assignee: (unassigned) => You-Sheng Yang (vicamo)

** Changed in: linux-firmware (Ubuntu Oracular)
     Assignee: (unassigned) => You-Sheng Yang (vicamo)

** Changed in: linux-firmware (Ubuntu Oracular)
   Importance: Undecided => High

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/2100769

Title:
  Update amdgpu FW for GC 11.5.1

Status in HWE Next:
  New
Status in linux-firmware package in Ubuntu:
  Fix Released
Status in linux-firmware source package in Noble:
  In Progress
Status in linux-firmware source package in Oracular:
  In Progress
Status in linux-firmware source package in Plucky:
  Fix Released

Bug description:
  [SRU Justification]

  [Impact]

  Products containing gfx1151 architecture with multiple
  microcontrollers (VPE, PSP, VCN, SDMA, etc.), observed a few page
  faults during heavy loading or with stress applications on the CRB.
  This requires rebasing these firmware versions to eliminate the risk.

  [Fix]

  * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
  * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
  * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
  * d316e650c ("amdgpu: update gc 11.5.1 firmware")
  * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")

  [Test Case]

  This was reported by hardware vendor using proprietary GPU stress
  software that is not freely available to generate heavy 3D rendering
  workload.

  [Where problems could occur]

  Opaque GPU firmware limited to related platforms. There might be
  further stability issues that need additional fixes from kernel, and
  we can only find out with more deployments later.

  [Other Info]

  Nominate only for Noble and Oracular, because Plucky already has all
  of them since version 20250204.git0fd450ee-0ubuntu1.

  ========== original bug report ==========

  Products containing gfx1151 architecture with multiple
  microcontrollers (VPE, PSP, VCN, SDMA, etc.), observed a few page
  faults during heavy loading or with stress applications on the CRB.
  This requires rebasing these firmware versions to eliminate the risk.

  # upstream tag 20250211
  * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
  # upstream tag 20250109
  # upstream tag 20241210
  * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
  * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
  * d316e650c ("amdgpu: update gc 11.5.1 firmware")
  # upstream tag 20241110
  # upstream tag 20240811
  * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
  # upstream tag 20240709

  [ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270433] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
  [ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) 
(0xa)
  [ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
  [ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
  [ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270454] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270470] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  ---
  ProblemType: Bug
  ApportVersion: 2.28.1-0ubuntu3.3
  Architecture: amd64
  CRDA: N/A
  CasperMD5CheckResult: pass
  Dependencies: firmware-sof-signed 2023.12.1-1ubuntu1.4
  DistroRelease: Ubuntu 24.04
  InstallationDate: Installed on 2024-05-07 (308 days ago)
  InstallationMedia: Ubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424)
  IwConfig:
   lo        no wireless extensions.

   enp193s0f0  no wireless extensions.
  MachineType: AMD MAPLE
  Package: linux-firmware
  PackageArchitecture: amd64
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.11.0-1016-oem 
root=UUID=ff988e57-9bf4-46d6-94cf-1e35d0139e12 ro 
amdgpu.ip_block_mask=0xfffffcff quiet splash vt.handoff=7
  ProcVersionSignature: Ubuntu 6.11.0-1016.16-oem 6.11.11
  RelatedPackageVersions:
   linux-restricted-modules-6.11.0-1016-oem N/A
   linux-backports-modules-6.11.0-1016-oem  N/A
   linux-firmware                           20240318.git3b128b60-0ubuntu2.10
  RfKill:

  Tags: noble package-from-proposed third-party-packages
  Uname: Linux 6.11.0-1016-oem x86_64
  UnreportableReason: This does not seem to be an official Ubuntu package. 
Please retry after updating the indexes of available packages, if that does not 
work then remove related third party packages and try again.
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: N/A
  _MarkForUpload: True
  dmi.bios.date: 04/25/2024 21:25:16
  dmi.bios.release: 0.0
  dmi.bios.vendor: AMD
  dmi.bios.version: RG60061C
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.name: MAPLE-STXH
  dmi.board.vendor: AMD
  dmi.board.version: RevB
  dmi.chassis.asset.tag: Chassis Asset Tag
  dmi.chassis.type: 10
  dmi.chassis.vendor: AMD
  dmi.chassis.version: 12345
  dmi.ec.firmware.release: 0.23
  dmi.modalias: 
dmi:bvnAMD:bvrRG60061C:bd04/25/2024212516:br0.0:efr0.23:svnAMD:pnMAPLE:pvrRG60061C:rvnAMD:rnMAPLE-STXH:rvrRevB:cvnAMD:ct10:cvr12345:sku12345678:
  dmi.product.family: STXH
  dmi.product.name: MAPLE
  dmi.product.sku: 12345678
  dmi.product.version: RG60061C
  dmi.sys.vendor: AMD

To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to