Muhammad Usama Anjun's recent series for being able to cancel the hibernate sequence [1] exposes a bug with amdgpu handling for skipping the thaw step.
Because the thaw step is skipped in most cases, cancelling the hibernate means that the device is left in an inconsistent state. To fix this allow devices to return -EBUSY and adjust the PM core handling of this case. Link: https://lore.kernel.org/all/[email protected]/ [1] Cc: Muhammad Usama Anjum <[email protected]> Mario Limonciello (3): PM: Mark device as suspended if it failed to resume PM: Don't pass up device_resume() -EBUSY errors drm/amd: Return -EBUSY for amdgpu_pmops_thaw() on success drivers/base/power/main.c | 7 ++++++- drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 2 +- 2 files changed, 7 insertions(+), 2 deletions(-) -- 2.43.0
