On 29/04/18 18:58, Stuart Henderson wrote:
> On 2018-04-29, Stuart Longland <[email protected]> wrote:
>> The rack has 5 servers, a ARM-based PC and the switch, all of which run
>> from a pair of 12V 105Ah AGM batteries, charged from mains power and
>> solar.  Switch is a Linksys LGS326-AU.  No other devices plugged into
>> this switch have connectivity issues.
>>
>> The port the industrial PC is connected to is a plain access port with
>> no VLAN tagging, trunking or other funny stuff (although all of the
>> above get used elsewhere in the network).
>>
>> When the link drops out, there's nothing in `dmesg`.  If I tether my
>> phone and hit the machine via SSH, I find it is unable to ping anything
>> on the internal network via IPv4 or IPv6, or vice versa.
> 
> What does "ifconfig rl0" show, both normally and when this happens?
> 
> How about "netstat -nI rl0"?

Not sure about netstat; but ifconfig rl0 isn't any different during and
outside of inaccessibility events.  Right now, I'm getting this:

> # ifconfig rl0                                                                
>                                           
> rl0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
>         lladdr 00:d0:c9:e0:f4:75
>         index 1 priority 0 llprio 3
>         media: Ethernet autoselect (100baseTX full-duplex)
>         status: active
>         inet 172.31.249.254 netmask 0xffffff00 broadcast 172.31.249.255
>         inet6 fe80::2d0:c9ff:fee0:f475%rl0 prefixlen 64 scopeid 0x1
>         inet6 2001:44b8:21ac:70f9::fe prefixlen 64> # netstat -nI rl0
> Name    Mtu   Network     Address              Ipkts Ierrs    Opkts Oerrs 
> Colls
> rl0     1500  <Link>      00:d0:c9:e0:f4:75 13402102     0 16803928     0     > 0
> rl0     1500  172.31.249/ 172.31.249.254    13402102     0 16803928     0     > 0
> rl0     1500  fe80::%rl0/ fe80::2d0:c9ff:fe 13402102     0 16803928     0     > 0
> rl0     1500  2001:44b8:2 2001:44b8:21ac:70 13402102     0 16803928     0     > 0

I'll try both commands again when I get problems.

> Is anything logged on the switch? This model has things like broadcast
> storm control, I'm wondering if that might have triggered or if it shows
> anything else useful.

I'll have a closer look, there were some messages about the link going
up and down, but it seems I missed turning the SNTP client on (settings
in two places) so the log timestamps are years off.  I've reset the logs
and will see how we go.

> Can you try a different cable, can you try a different switch port?

Unfortunately all ports are full.  I did replace the cable though.

>> Is there some sort of debugging flag I can turn on in the kernel to log
>> more detail about what's going on with rl0 when the loss of connectivity
>> is being experienced?
> 
> I don't see any extra debugging that can be enabled for rl.
> 

No problems, well, I'll keep an eye on the switch and see what it tells me.
-- 
Stuart Longland (aka Redhatter, VK4MSL)

I haven't lost my mind...
  ...it's backed up on a tape somewhere.

Reply via email to