** Description changed: + [SRU Justification] + + [Impact] + + Products containing gfx1151 architecture with multiple microcontrollers + (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy + loading or with stress applications on the CRB. This requires rebasing + these firmware versions to eliminate the risk. + + [Fix] + + * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware") + * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware") + * 4a172771d ("amdgpu: update psp 14.0.1 firmware") + * d316e650c ("amdgpu: update gc 11.5.1 firmware") + * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware") + + [Test Case] + + This was reported by hardware vendor using proprietary GPU stress + software that is not freely available to generate heavy 3D rendering + workload. + + [Where problems could occur] + + Opaque GPU firmware limited to related platforms. There might be further + stability issues that need additional fixes from kernel, and we can only + find out with more deployments later. + + [Other Info] + + Nominate only for Noble and Oracular, because Plucky already has all of + them since version 20250204.git0fd450ee-0ubuntu1. + + ========== original bug report ========== + Products containing gfx1151 architecture with multiple microcontrollers (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy loading or with stress applications on the CRB. This requires rebasing these firmware versions to eliminate the risk. # upstream tag 20250211 * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware") # upstream tag 20250109 # upstream tag 20241210 * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware") * 4a172771d ("amdgpu: update psp 14.0.1 firmware") * d316e650c ("amdgpu: update gc 11.5.1 firmware") # upstream tag 20241110 # upstream tag 20240811 * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware") # upstream tag 20240709 [ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:9 pasid:32771) [ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 thread redshiftCmdLine pid 3362) [ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 0x0000000000000000 from client 10 [ 217.270433] amdgpu 0000:c5:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431 [ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) (0xa) [ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1 [ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0 [ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3 [ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0 [ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0 [ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:9 pasid:32771) [ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 thread redshiftCmdLine pid 3362) [ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 0x0000000000000000 from client 10 [ 217.270454] amdgpu 0000:c5:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000 [ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0) [ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0 [ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0 [ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0 [ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0 [ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0 [ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:9 pasid:32771) [ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid 3362 thread redshiftCmdLine pid 3362) [ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address 0x0000000000000000 from client 10 [ 217.270470] amdgpu 0000:c5:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000 [ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0) [ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0 [ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0 [ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0 [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0 [ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0 - --- + --- ProblemType: Bug ApportVersion: 2.28.1-0ubuntu3.3 Architecture: amd64 CRDA: N/A CasperMD5CheckResult: pass Dependencies: firmware-sof-signed 2023.12.1-1ubuntu1.4 DistroRelease: Ubuntu 24.04 InstallationDate: Installed on 2024-05-07 (308 days ago) InstallationMedia: Ubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424) IwConfig: - lo no wireless extensions. - - enp193s0f0 no wireless extensions. + lo no wireless extensions. + + enp193s0f0 no wireless extensions. MachineType: AMD MAPLE Package: linux-firmware PackageArchitecture: amd64 ProcFB: 0 amdgpudrmfb ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.11.0-1016-oem root=UUID=ff988e57-9bf4-46d6-94cf-1e35d0139e12 ro amdgpu.ip_block_mask=0xfffffcff quiet splash vt.handoff=7 ProcVersionSignature: Ubuntu 6.11.0-1016.16-oem 6.11.11 RelatedPackageVersions: - linux-restricted-modules-6.11.0-1016-oem N/A - linux-backports-modules-6.11.0-1016-oem N/A - linux-firmware 20240318.git3b128b60-0ubuntu2.10 + linux-restricted-modules-6.11.0-1016-oem N/A + linux-backports-modules-6.11.0-1016-oem N/A + linux-firmware 20240318.git3b128b60-0ubuntu2.10 RfKill: - + Tags: noble package-from-proposed third-party-packages Uname: Linux 6.11.0-1016-oem x86_64 UnreportableReason: This does not seem to be an official Ubuntu package. Please retry after updating the indexes of available packages, if that does not work then remove related third party packages and try again. UpgradeStatus: No upgrade log present (probably fresh install) UserGroups: N/A _MarkForUpload: True dmi.bios.date: 04/25/2024 21:25:16 dmi.bios.release: 0.0 dmi.bios.vendor: AMD dmi.bios.version: RG60061C dmi.board.asset.tag: Base Board Asset Tag dmi.board.name: MAPLE-STXH dmi.board.vendor: AMD dmi.board.version: RevB dmi.chassis.asset.tag: Chassis Asset Tag dmi.chassis.type: 10 dmi.chassis.vendor: AMD dmi.chassis.version: 12345 dmi.ec.firmware.release: 0.23 dmi.modalias: dmi:bvnAMD:bvrRG60061C:bd04/25/2024212516:br0.0:efr0.23:svnAMD:pnMAPLE:pvrRG60061C:rvnAMD:rnMAPLE-STXH:rvrRevB:cvnAMD:ct10:cvr12345:sku12345678: dmi.product.family: STXH dmi.product.name: MAPLE dmi.product.sku: 12345678 dmi.product.version: RG60061C dmi.sys.vendor: AMD
** Changed in: linux-firmware (Ubuntu Oracular) Status: New => In Progress ** Changed in: linux-firmware (Ubuntu Noble) Status: New => In Progress ** Changed in: linux-firmware (Ubuntu Noble) Importance: Undecided => High ** Changed in: linux-firmware (Ubuntu Noble) Assignee: (unassigned) => You-Sheng Yang (vicamo) ** Changed in: linux-firmware (Ubuntu Oracular) Assignee: (unassigned) => You-Sheng Yang (vicamo) ** Changed in: linux-firmware (Ubuntu Oracular) Importance: Undecided => High -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2100769 Title: Update amdgpu FW for GC 11.5.1 To manage notifications about this bug go to: https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs