After running ereything with AMD_DEBUG=nongg, the "Waiting for fences"
seems to be mostly gone, and I now get sdma0 timeouts. So this seems to
be part of the gereral cluster of failures that seem to plague the linux
navi drivers since the beginning.

Jun 22 07:24:43 alhazen kernel: [  748.740480] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=134116, emitted 
seq=134118
Jun 22 07:24:43 alhazen kernel: [  748.740549] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* Process information: process Xorg pid 3589 thread Xorg:cs0 
pid 3591
Jun 22 07:24:43 alhazen kernel: [  748.740552] [drm] GPU recovery disabled.
Jun 22 07:25:28 alhazen kernel: [  794.386634] GpuWatchdog[5797]: segfault at 0 
ip 0000556cdabbccb9 sp 00007f6a540a06c0 error 6 in chrome[556cd6a4e000+7095000]
Jun 22 07:25:28 alhazen kernel: [  794.386642] Code: 00 79 09 48 8b 7d c0 e8 d5 
14 2b fc c7 45 c0 aa aa aa aa 0f ae f0 41 8b 84 24 e0 00 00 00 89 45 c0 48 8d 
7d c0 e8 b7 31 e9 fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 
5d 41 5e
Jun 22 07:25:38 alhazen kernel: [  804.548718] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=29322, emitted seq=29326
Jun 22 07:25:38 alhazen kernel: [  804.548788] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jun 22 07:25:38 alhazen kernel: [  804.548791] [drm] GPU recovery disabled.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1883493

Title:
  amdgpu hangs from time to time with *ERROR* Waiting for fences timed
  out!

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  un 15 08:30:42 alhazen kernel: [ 1566.155810] 
[drm:amdgpu_dm_commit_planes.constprop.0 [amdgpu]] *ERROR* Waiting for fences 
timed out!
  Jun 15 08:30:47 alhazen kernel: [ 1566.159792] 
[drm:amdgpu_dm_commit_planes.constprop.0 [amdgpu]] *ERROR* Waiting for fences 
timed out!
  Jun 15 08:30:47 alhazen kernel: [ 1571.020144] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=535493, emitted 
seq=535495
  Jun 15 08:30:47 alhazen kernel: [ 1571.020216] [drm:amdgpu_job_timedout 
[amdgpu]] *ERROR* Process information: process Xorg pid 3664 thread Xorg:cs0 
pid 3694
  Jun 15 08:30:47 alhazen kernel: [ 1571.020218] [drm] GPU recovery disabled.

  Mouse pointer still moves, but apart from that the display is frozen.
  Music keeps playing.

  ProblemType: Bug
  DistroRelease: Ubuntu 20.04
  Package: linux-image-5.4.0-37-generic 5.4.0-37.41
  ProcVersionSignature: Ubuntu 5.4.0-37.41-generic 5.4.41
  Uname: Linux 5.4.0-37-generic x86_64
  ApportVersion: 2.20.11-0ubuntu27.2
  Architecture: amd64
  CasperMD5CheckResult: skip
  CurrentDesktop: ubuntu:GNOME
  Date: Mon Jun 15 09:09:56 2020
  InstallationDate: Installed on 2020-05-28 (17 days ago)
  InstallationMedia: Ubuntu 20.04 LTS "Focal Fossa" - Release amd64 (20200423)
  IwConfig:
   enp4s0    no wireless extensions.
   
   lo        no wireless extensions.
  MachineType: System manufacturer System Product Name
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.4.0-37-generic 
root=/dev/mapper/vgubuntu-root ro quiet splash acpi-enforce-resources=lax 
vt.handoff=7
  RelatedPackageVersions:
   linux-restricted-modules-5.4.0-37-generic N/A
   linux-backports-modules-5.4.0-37-generic  N/A
   linux-firmware                            1.187
  RfKill:
   
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 07/02/2019
  dmi.bios.vendor: American Megatrends Inc.
  dmi.bios.version: 0604
  dmi.board.asset.tag: Default string
  dmi.board.name: PRIME X570-PRO
  dmi.board.vendor: ASUSTeK COMPUTER INC.
  dmi.board.version: Rev X.0x
  dmi.chassis.asset.tag: Default string
  dmi.chassis.type: 3
  dmi.chassis.vendor: Default string
  dmi.chassis.version: Default string
  dmi.modalias: 
dmi:bvnAmericanMegatrendsInc.:bvr0604:bd07/02/2019:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnPRIMEX570-PRO:rvrRevX.0x:cvnDefaultstring:ct3:cvrDefaultstring:
  dmi.product.family: To be filled by O.E.M.
  dmi.product.name: System Product Name
  dmi.product.sku: SKU
  dmi.product.version: System Version
  dmi.sys.vendor: System manufacturer

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1883493/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to