Hi,
On Thu, 18 Nov 2010 16:38:55 +0100
Manuel Guesdon <[email protected]> wrote:
>| Is there a way to get detailed em(4) device errors without having to
>| recompile kernel with EM_DEBUG ?
>| I try to find in-errors reason(s) but netstat only gives errors as a sum of
>| dropped_pkts + stats.rxerrc + stats.crcerrs + sc->stats.algnerrc +... as far
>| as I can see :-(
I took me some time to upgrade to 4.8 version and modify kernel to get detail
info on demand.
I still have the problem on multiple servers (but with very similar hardware
and software).
em4 at pci11 dev 0 function 0 "Intel PRO/1000 (82576)" rev 0x01: apic 9 int 15
(irq 15), address 00:25:90:05:53:3e
em4: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
lladdr 00:25:90:05:53:3e
description: br1.th2
priority: 0
groups: pf-off
media: Ethernet autoselect (1000baseT
full-duplex,master,rxpause,txpause)
status: active
inet6 fe80::225:90ff:fe05:533e%em4 prefixlen 64 scopeid 0x5
#netstat -I em4 -d
Name Mtu Network Address Ipkts Ierrs Opkts
Oerrs Colls Drop
em4 1500 <Link> 00:25:90:05:53:3e 8936976317 4614835 5430820423
0 0 0
em4 1500 fe80::%em4/ fe80::225:90ff:fe 8936976317 4614835 5430820423
0 0 0
Detailed stats:
em4: Dropped PKTS = 0
em4: Excessive collisions = 0
em4: Symbol errors = 0
em4: Sequence errors = 0
em4: Defer count = 353
em4: Missed Packets = 4241586
em4: Receive No Buffers = 5297798
em4: Receive Length Errors = 0
em4: Receive errors = 0
em4: Crc errors = 0
em4: Alignment errors = 0
em4: Carrier extension errors = 0
em4: RX overruns = 372913
em4: watchdog timeouts = 0
em4: XON Rcvd = 3086
em4: XON Xmtd = 592675
em4: XOFF Rcvd = 164449
em4: XOFF Xmtd = 4833995
em4: Good Packets Rcvd = 8936940571
em4: Good Packets Xmtd = 5430798347
At this time, the interface carry around 56mbps inbound and 35Mbps outbound
Server load is 0.14
The em4 interface is connected to a an interface on another server with near
same config (but I get same kind of problem for interfaces connected to switch
with copper and fiber).
Errors seems a little related to interface load but not very closely.
Each servers have 2xQuad-ports cards (82576) + 2 ports on motherboard (82576
too).
I was thinking of problem with interrupt mitigation.
Any idea, comments, things to test ?
Thank you !
Manuel