Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-16 Thread Genes Lists via arch-general
On 2/16/22 16:28, David Rosenstrauch via arch-general wrote: Well it's great to hear you've got a decent explanation - and consistent with the cpu hangs you noted. good luck getting everything fully operational. gene

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-16 Thread David Rosenstrauch via arch-general
On 2/11/22 11:44 AM, David Rosenstrauch via arch-general wrote: On 2/11/22 9:21 AM, Genes Lists via arch-general wrote: Also it may be worthwhile running memcheck to be sure your memory is not faulty. Yeah that thought occurred to me as well.  I ran a quick memtest86+ when I first built th

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-12 Thread Uwe Sauter via arch-general
sorry i know this kind of thing is frustrating to deal with. Big time.  I've gotten very good over the years at diagnosing and fixing issues using log messages.  But sudden catastrophic crashes like this that don't leave any trace in the logs/journal are *really* hard to pin down. Did you a

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-12 Thread Ralph Corderoy via arch-general
Hi David, Looking at your http://darose.net/ServerCrash20220209.png, are you aware of https://www.kernel.org/doc/Documentation/RCU/stallwarn.txt which has detail on what it means? Though it looks to me like at least one line of output has been trampled. Also, one Google'd suggestion was a real-t

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread David Rosenstrauch via arch-general
Thanks much for following up! Responses inline. On 2/11/22 4:15 PM, Genes Lists via arch-general wrote: I suppose it could also be southbridge being annoying It's a very new machine (Rocket Lake and PCIE4) so doesn't technically use the traditional northbridge/southbridge model. But point

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread Genes Lists via arch-general
I suppose it could also be southbridge being annoying - did you check temps on the mobo are reasonable to be sure of adequate cooling? Are you overclocking at all by chance? sorry i know this kind of thing is frustrating to deal with.

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread David Rosenstrauch via arch-general
On 2/11/22 9:56 AM, Genes Lists via arch-general wrote: Another thought - if you can try a different network hardware that might be useful as well. And to be clear, this kernel is running on physical hardware not a VM right? If you're running on VM please share which host and VM is used.

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread David Rosenstrauch via arch-general
On 2/11/22 9:21 AM, Genes Lists via arch-general wrote: On 2/10/22 18:32, David Rosenstrauch via arch-general wrote: ... ... "rcu_preempt detected stalls on cpus/tasks".  But searching on that Since the CPU is stalled and unable to make further progress something is inhibiting it. One thou

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread Genes Lists via arch-general
Another thought - if you can try a different network hardware that might be useful as well. And to be clear, this kernel is running on physical hardware not a VM right? If you're running on VM please share which host and VM is used. thanks.

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-11 Thread Genes Lists via arch-general
On 2/10/22 18:32, David Rosenstrauch via arch-general wrote: ... ... "rcu_preempt detected stalls on cpus/tasks".  But searching on that Since the CPU is stalled and unable to make further progress something is inhibiting it. One thought - I wonder if the CPU is waiting for memory IO - in whic

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-10 Thread David Rosenstrauch via arch-general
Thanks much for the response. I'm not really clear what specifically that might mean though: A buffer in the kernel? In an application? And how might I debug that and try to prevent it from happening again? Thanks, DR On 2/10/22 6:44 PM, mick howe via arch-general wrote: seems so obvious

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-10 Thread mick howe via arch-general
seems so obvious you already illuminated it but some buffer not clearing old data? mick in glen innes 2370 On Fri, 11 Feb 2022 at 09:32, David Rosenstrauch via arch-general wrote: > > Following up on this crash issue I keep having with my Arch server. > Basically server just completely freezes u

Re: [arch-general] Linux server crash causing router switch to stop working

2022-02-10 Thread David Rosenstrauch via arch-general
Following up on this crash issue I keep having with my Arch server. Basically server just completely freezes up - doesn't respond to pings, or keyboard/mouse input, and eventually has to just be rebooted. Good news is: a) after upgrading everything (including the router firmware) it no longer

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-24 Thread David Rosenstrauch via arch-general
On 1/24/22 2:47 AM, Uwe Sauter via arch-general wrote: Does your server support IPMI and thus SOL (serial over LAN)? Thanks for the suggestion. Unfortunately this is just a home/desktop server, so I don't think its mobo supports IPMI. Thanks, DR

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-24 Thread Andy Pieters via arch-general
On Mon, 24 Jan 2022 at 15:50, Genes Lists via arch-general < arch-general@lists.archlinux.org> wrote: > > > It's hard to know for sure exactly what's happening, as the server is > > headless. It does have a KVM-over-IP hooked up to it, though. But when > > Perhaps you can login after server is b

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-24 Thread Genes Lists via arch-general
On 1/23/22 21:12, David Rosenstrauch via arch-general wrote: Tnx much for the reply!  Responses inline below. ... It's hard to know for sure exactly what's happening, as the server is headless.  It does have a KVM-over-IP hooked up to it, though.  But when Perhaps you can login after serv

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-23 Thread Uwe Sauter via arch-general
Hello David, >>  When you say server crashes - I assume kernel crashed and is not >> functioning (as opposed to >> kernel is up and running but network is not doing what I want)? Does server >> respond to any >> keyboard or mouse movement on console? > > It's hard to know for sure exactly w

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-23 Thread David Rosenstrauch via arch-general
On 1/21/22 5:08 PM, Łukasz Michalski via arch-general wrote: When it crashes, instead of reconnecting router connect any laptop with arch booted from usb. If link comes up then use tcpdump to see what is happening on the wire. That's a good suggestion - thanks. (No better way to see if the

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-23 Thread David Rosenstrauch via arch-general
Tnx much for the reply! Responses inline below. On 1/21/22 2:29 PM, Genes Lists via arch-general wrote: When you say server crashes - I assume kernel crashed and is not functioning (as opposed to kernel is up and running but network is not doing what I want)? Does server respond to any k

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-21 Thread Łukasz Michalski via arch-general
On 1/21/22 18:54, David Rosenstrauch via arch-general wrote: I figured the issue was a problem with my wifi router, but after much debugging I don't think that's the case.  I have a new (Arch) linux server I built a few weeks ago.  What seems to be happening is that my server crashes for some r

Re: [arch-general] Linux server crash causing router switch to stop working

2022-01-21 Thread Genes Lists via arch-general
On 1/21/22 12:54, David Rosenstrauch via arch-general wrote: Been experiencing a weird issue several times recently that's got me stumped. Hi David Sounds a bit tricky to sort out - not sure can help but maybe you can help us understand your set up a bit more. When you say server crashe

[arch-general] Linux server crash causing router switch to stop working

2022-01-21 Thread David Rosenstrauch via arch-general
Been experiencing a weird issue several times recently that's got me stumped. A couple of weeks ago, my entire home network went dead right in the middle of a zoom call. Same problem happened again late last night. The problem is intermittent/occasional: everything runs without issue for se