Begin forwarded message:
Date: Mon, 17 Jul 2017 08:34:16 +0000 From: bugzilla-dae...@bugzilla.kernel.org To: step...@networkplumber.org Subject: [Bug 196399] New: WARNING at net/sched/sch_generic.c:316 dev_watchdog[...] when suspending https://bugzilla.kernel.org/show_bug.cgi?id=196399 Bug ID: 196399 Summary: WARNING at net/sched/sch_generic.c:316 dev_watchdog[...] when suspending Product: Networking Version: 2.5 Kernel Version: 4.11+ Hardware: All OS: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: Other Assignee: step...@networkplumber.org Reporter: martin.pe...@free.fr Regression: No Hello, We have found out that since at least 4.11-rc1, some machines in the Intel GFX CI lab have been generating the following warning when suspending to s4 (suspend to disk): [ 287.212825] ------------[ cut here ]------------ [ 287.212829] WARNING: CPU: 0 PID: 3165 at net/sched/sch_generic.c:316 dev_watchdog+0x218/0x220 [ 287.212830] Modules linked in: mcs7830 usbnet mii snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic i915 x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul snd_hda_intel snd_hda_codec snd_hwdep ghash_clmulni_intel snd_hda_core snd_pcm i2c_designware_platform i2c_designware_core mei_me mei prime_numbers i2c_hid pinctrl_sunrisepoint pinctrl_intel [ 287.212864] CPU: 0 PID: 3165 Comm: gem_exec_suspen Tainted: G U 4.12.0-CI-CI_DRM_2829+ #1 [ 287.212865] Hardware name: Dell Inc. XPS 13 9360/093TW6, BIOS 1.3.2 01/18/2017 [ 287.212867] task: ffff8801b4084f40 task.stack: ffffc900001d8000 [ 287.212869] RIP: 0010:dev_watchdog+0x218/0x220 [ 287.212870] RSP: 0018:ffff88027e403e38 EFLAGS: 00010292 [ 287.212872] RAX: 000000000000005a RBX: 0000000000000000 RCX: 0000000000000000 [ 287.212874] RDX: 0000000000000002 RSI: ffffffff81cbcf89 RDI: ffffffff81c9c627 [ 287.212875] RBP: ffff88027e403e68 R08: 0000000000000000 R09: 0000000000000001 [ 287.212876] R10: 0000000028e9c215 R11: 0000000000000000 R12: ffff88026e08a848 [ 287.212877] R13: 0000000000000000 R14: ffff88026e050020 R15: 0000000000000001 [ 287.212878] FS: 00007f345056a8c0(0000) GS:ffff88027e400000(0000) knlGS:0000000000000000 [ 287.212880] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 287.212881] CR2: 00000000008d7008 CR3: 00000001b4314000 CR4: 00000000003406f0 [ 287.212882] Call Trace: [ 287.212883] <IRQ> [ 287.212886] ? qdisc_rcu_free+0x40/0x40 [ 287.212888] ? qdisc_rcu_free+0x40/0x40 [ 287.212891] call_timer_fn+0x8e/0x370 [ 287.212894] ? qdisc_rcu_free+0x40/0x40 [ 287.212896] expire_timers+0x150/0x1f0 [ 287.212899] run_timer_softirq+0x7c/0x160 [ 287.212903] __do_softirq+0x116/0x4a0 [ 287.212906] irq_exit+0xa9/0xc0 [ 287.212909] smp_apic_timer_interrupt+0x38/0x50 [ 287.212912] apic_timer_interrupt+0x90/0xa0 [ 287.212914] RIP: 0010:delay_tsc+0x33/0xc0 [ 287.212916] RSP: 0018:ffffc900001dbcd8 EFLAGS: 00000286 ORIG_RAX: ffffffffffffff10 [ 287.212918] RAX: 0000000080000000 RBX: 00000005964f23a0 RCX: 0000000000000001 [ 287.212919] RDX: 0000000080000001 RSI: ffffffff81c8e23a RDI: 00000000ffffffff [ 287.212920] RBP: ffffc900001dbcf8 R08: 0000000000000000 R09: 0000000000000001 [ 287.212921] R10: 0000000000000000 R11: 0000000000000000 R12: 000000059633478e [ 287.212922] R13: 0000000000249f13 R14: 0000000000000000 R15: ffff880272eac008 [ 287.212924] </IRQ> [ 287.212929] ? delay_tsc+0x6b/0xc0 [ 287.212932] __delay+0xa/0x10 [ 287.212934] __const_udelay+0x31/0x40 [ 287.212936] hibernation_debug_sleep+0x20/0x30 [ 287.212938] hibernation_snapshot+0x2bc/0x5f0 [ 287.212940] hibernate+0x159/0x2f0 [ 287.212943] state_store+0xe0/0xf0 [ 287.212947] kobj_attr_store+0xf/0x20 [ 287.212949] sysfs_kf_write+0x40/0x50 [ 287.212951] kernfs_fop_write+0x130/0x1b0 [ 287.212955] __vfs_write+0x23/0x120 [ 287.212957] ? rcu_read_lock_sched_held+0x75/0x80 [ 287.212959] ? rcu_sync_lockdep_assert+0x2a/0x50 [ 287.212961] ? __sb_start_write+0xfa/0x1f0 [ 287.212964] vfs_write+0xc5/0x1d0 [ 287.212966] ? trace_hardirqs_on_caller+0xe7/0x1c0 [ 287.212969] SyS_write+0x44/0xb0 [ 287.212972] entry_SYSCALL_64_fastpath+0x1c/0xb1 [ 287.212973] RIP: 0033:0x7f344ed4a4a0 [ 287.212974] RSP: 002b:00007ffef50dfaa8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 [ 287.212977] RAX: ffffffffffffffda RBX: ffffffff81470683 RCX: 00007f344ed4a4a0 [ 287.212978] RDX: 0000000000000004 RSI: 000000000041d211 RDI: 0000000000000006 [ 287.212979] RBP: ffffc900001dbf88 R08: 00000000008d6a50 R09: 0000000000000000 [ 287.212980] R10: 0000000000000000 R11: 0000000000000246 R12: 000000000041d211 [ 287.212981] R13: 0000000000000006 R14: 0000000000000000 R15: 0000000000000000 [ 287.212984] ? __this_cpu_preempt_check+0x13/0x20 [ 287.212988] Code: 63 8e 18 04 00 00 eb 93 4c 89 f7 c6 05 77 5c 77 00 01 e8 dc 7f fd ff 89 d9 48 89 c2 4c 89 f6 48 c7 c7 18 f4 cf 81 e8 f1 c4 9d ff <0f> ff eb c3 0f 1f 40 00 48 c7 47 08 00 00 00 00 55 48 c7 07 00 [ 287.213051] ---[ end trace b6016dcc7544a681 ]--- This is caught while running the intel-gpu-tools test named 'igt@gem_exec_suspend@basic-s4-devices' on the following machines: - Intel Kaby Lake-R RVP: Failure rate 123/135 run(s) (91%), last occurence: https://intel-gfx-ci.01.org/CI/CI_DRM_2828/fi-kbl-r/igt@gem_exec_susp...@basic-s4-devices.html - Intel Kaby Lake i7-7560u: Failure rate 196/305 run(s) (64%), last occurence: https://intel-gfx-ci.01.org/CI/CI_DRM_2827/fi-kbl-7560u/igt@gem_exec_susp...@basic-s4-devices.html - Intel Skylake i7-6600u: Failure rate 23/75 run(s) (30%), last occurence: https://intel-gfx-ci.01.org/CI/CI_DRM_2824/fi-skl-6600u/igt@gem_exec_susp...@basic-s4-devices.html - Intel Sandy Bridge i7-2600: Failure rate 10/293 run(s) (3%), last occurence: https://intel-gfx-ci.01.org/CI/CI_DRM_2816/fi-snb-2600/igt@gem_exec_susp...@basic-s4-devices.html We have plenty of other machines that do not trigger this warning at all. The bug used to live in fd.o's bugzilla, but it had no business being there: https://bugs.freedesktop.org/show_bug.cgi?id=100125 Let me know if I can help in some ways. -- You are receiving this mail because: You are the assignee for the bug.