On Thu, 2015-08-20 at 21:37 +0200, Tomas Papan wrote:
> Hi there,
> 
> I’m observing a freeze with the recent kernel (4.2-rc7). Unfortunately I 
> can’t preserver the full traces.
> There is nothing in the messages after reboot, I was just lucky one time to 
> see it when tail -f /var/log/messages was running.
> This is the only line which I was able to get:
>       eth1 (e1000e): transmit queue 0 timed out
> 
> I’ve got this message in the past, but the ethtool -K eth1 tso off solved 
> that. I’m always running
> this command at the boot time since then. There is no issue with 4.2-rc4. It 
> is hard to bisect,
> because this machine is used as headless server and it happens randomly 
> (usually within 2 hours).
> 
> Do you have any idea how to trace it or what can I do?
> Please keep me on CC since I’m not subscribed on this list
> 
> Regards
> Tomas

I would pull latest tree from Linus and pray the bug was fixed.

My feeling is that you hit the issue fixed with

commit 83fccfc3940c4a2db90fd7e7079f5b465cd8c6af
Author: Eric Dumazet <eduma...@google.com>
Date:   Thu Aug 13 15:44:51 2015 -0700

    inet: fix potential deadlock in reqsk_queue_unlink()
    
    When replacing del_timer() with del_timer_sync(), I introduced
    a deadlock condition :
    
    reqsk_queue_unlink() is called from inet_csk_reqsk_queue_drop()
    
    inet_csk_reqsk_queue_drop() can be called from many contexts,
    one being the timer handler itself (reqsk_timer_handler()).
    
    In this case, del_timer_sync() loops forever.
    
    Simple fix is to test if timer is pending.
    
    Fixes: 2235f2ac75fd ("inet: fix races with reqsk timers")
    Signed-off-by: Eric Dumazet <eduma...@google.com>
    Signed-off-by: David S. Miller <da...@davemloft.net>


--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to