** Description changed:

+ [SRU Justification]
+ 
+ [Impact]
+ 
+ Products containing gfx1151 architecture with multiple microcontrollers
+ (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
+ loading or with stress applications on the CRB. This requires rebasing
+ these firmware versions to eliminate the risk.
+ 
+ [Fix]
+ 
+ * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
+ * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
+ * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
+ * d316e650c ("amdgpu: update gc 11.5.1 firmware")
+ * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
+ 
+ [Test Case]
+ 
+ This was reported by hardware vendor using proprietary GPU stress
+ software that is not freely available to generate heavy 3D rendering
+ workload.
+ 
+ [Where problems could occur]
+ 
+ Opaque GPU firmware limited to related platforms. There might be further
+ stability issues that need additional fixes from kernel, and we can only
+ find out with more deployments later.
+ 
+ [Other Info]
+ 
+ Nominate only for Noble and Oracular, because Plucky already has all of
+ them since version 20250204.git0fd450ee-0ubuntu1.
+ 
+ ========== original bug report ==========
+ 
  Products containing gfx1151 architecture with multiple microcontrollers
  (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
  loading or with stress applications on the CRB. This requires rebasing
  these firmware versions to eliminate the risk.
  
  # upstream tag 20250211
  * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
  # upstream tag 20250109
  # upstream tag 20241210
  * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
  * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
  * d316e650c ("amdgpu: update gc 11.5.1 firmware")
  # upstream tag 20241110
  # upstream tag 20240811
  * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
  # upstream tag 20240709
  
  [ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270433] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
  [ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) 
(0xa)
  [ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
  [ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
  [ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270454] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
  [ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 
ring:24 vmid:9 pasid:32771)
  [ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 
3362 thread redshiftCmdLine pid 3362)
  [ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 
0x0000000000000000 from client 10
  [ 217.270470] amdgpu 0000:c5:00.0: amdgpu: 
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
  [ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
  [ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
  [ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
  [ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
  [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
- --- 
+ ---
  ProblemType: Bug
  ApportVersion: 2.28.1-0ubuntu3.3
  Architecture: amd64
  CRDA: N/A
  CasperMD5CheckResult: pass
  Dependencies: firmware-sof-signed 2023.12.1-1ubuntu1.4
  DistroRelease: Ubuntu 24.04
  InstallationDate: Installed on 2024-05-07 (308 days ago)
  InstallationMedia: Ubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424)
  IwConfig:
-  lo        no wireless extensions.
-  
-  enp193s0f0  no wireless extensions.
+  lo        no wireless extensions.
+ 
+  enp193s0f0  no wireless extensions.
  MachineType: AMD MAPLE
  Package: linux-firmware
  PackageArchitecture: amd64
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.11.0-1016-oem 
root=UUID=ff988e57-9bf4-46d6-94cf-1e35d0139e12 ro 
amdgpu.ip_block_mask=0xfffffcff quiet splash vt.handoff=7
  ProcVersionSignature: Ubuntu 6.11.0-1016.16-oem 6.11.11
  RelatedPackageVersions:
-  linux-restricted-modules-6.11.0-1016-oem N/A
-  linux-backports-modules-6.11.0-1016-oem  N/A
-  linux-firmware                           20240318.git3b128b60-0ubuntu2.10
+  linux-restricted-modules-6.11.0-1016-oem N/A
+  linux-backports-modules-6.11.0-1016-oem  N/A
+  linux-firmware                           20240318.git3b128b60-0ubuntu2.10
  RfKill:
-  
+ 
  Tags: noble package-from-proposed third-party-packages
  Uname: Linux 6.11.0-1016-oem x86_64
  UnreportableReason: This does not seem to be an official Ubuntu package. 
Please retry after updating the indexes of available packages, if that does not 
work then remove related third party packages and try again.
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: N/A
  _MarkForUpload: True
  dmi.bios.date: 04/25/2024 21:25:16
  dmi.bios.release: 0.0
  dmi.bios.vendor: AMD
  dmi.bios.version: RG60061C
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.name: MAPLE-STXH
  dmi.board.vendor: AMD
  dmi.board.version: RevB
  dmi.chassis.asset.tag: Chassis Asset Tag
  dmi.chassis.type: 10
  dmi.chassis.vendor: AMD
  dmi.chassis.version: 12345
  dmi.ec.firmware.release: 0.23
  dmi.modalias: 
dmi:bvnAMD:bvrRG60061C:bd04/25/2024212516:br0.0:efr0.23:svnAMD:pnMAPLE:pvrRG60061C:rvnAMD:rnMAPLE-STXH:rvrRevB:cvnAMD:ct10:cvr12345:sku12345678:
  dmi.product.family: STXH
  dmi.product.name: MAPLE
  dmi.product.sku: 12345678
  dmi.product.version: RG60061C
  dmi.sys.vendor: AMD

** Changed in: linux-firmware (Ubuntu Oracular)
       Status: New => In Progress

** Changed in: linux-firmware (Ubuntu Noble)
       Status: New => In Progress

** Changed in: linux-firmware (Ubuntu Noble)
   Importance: Undecided => High

** Changed in: linux-firmware (Ubuntu Noble)
     Assignee: (unassigned) => You-Sheng Yang (vicamo)

** Changed in: linux-firmware (Ubuntu Oracular)
     Assignee: (unassigned) => You-Sheng Yang (vicamo)

** Changed in: linux-firmware (Ubuntu Oracular)
   Importance: Undecided => High

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2100769

Title:
  Update amdgpu FW for GC 11.5.1

To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to