On Thu, 19 Oct 2006 08:10:33 -0700 Shane <[EMAIL PROTECTED]> wrote: > Hello, > > I am experiencing an intermittent crash with a Gigabit > controler using the sky2 driver under load. Confirmed on > 2.6.19-rc2 but also present with 2.6.18. After the lockup, > the system works as normally but the sky2 interface no > longer processes traffic. Here's the printk output: > NETDEV WATCHDOG: eth0: transmit timed out > sky2 eth0: tx timeout > sky2 eth0: transmit ring 401 .. 378 report=403 done=403 > sky2 status report lost?
See below, most likely lost IRQ. > BUG: soft lockup detected on CPU#0! > [<781447d5>] [<78128ad6>] [<78113b91>] [<7810390b>] > [<782b2477>] [<f8986e43>] [<7825fc69>] [<7825fce5>] > [<78128a30>] [<78124eb1>] [<78124f49>] [<78113b96>] > [<7810390b>] [<78101265>] [<78101281>] [<78101d4f>] > [<7837578c>] [<783751e0>] ======================= > > And the controler: > 03:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E > Gigabit > Ethernet Controller (rev 22) > Subsystem: Giga-byte Technology Marvell 88E8053 Gigabit Ethernet > Control > ler (Gigabyte) > Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- > Step > ping- SERR- FastB2B- > Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- > <TAbort- > <MAbort- >SERR- <PERR- > Latency: 0, Cache Line Size: 32 bytes > Interrupt: pin A routed to IRQ 218 > Region 0: Memory at e8000000 (64-bit, non-prefetchable) [size=16K] > Region 2: I/O ports at 8000 [size=256] > [virtual] Expansion ROM at 88000000 [disabled] [size=128K] > Capabilities: [48] Power Management version 2 > Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA > PME(D0+,D1+,D2+,D3hot > +,D3cold+) > Status: D0 PME-Enable- DSel=0 DScale=1 PME- > Capabilities: [50] Vital Product Data > Capabilities: [5c] Message Signalled Interrupts: Mask- 64bit+ > Queue=0/1 > Enable+ > Address: 00000000fee0300c Data: 4142 > Capabilities: [e0] Express Legacy Endpoint IRQ 0 > Device: Supported: MaxPayload 128 bytes, PhantFunc 0, ExtTag- > Device: Latency L0s unlimited, L1 unlimited > Device: AtnBtn- AtnInd- PwrInd- > Device: Errors: Correctable- Non-Fatal- Fatal- Unsupported- > Device: RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- > Device: MaxPayload 128 bytes, MaxReadReq 512 bytes > Link: Supported Speed 2.5Gb/s, Width x1, ASPM L0s, Port 0 > Link: Latency L0s <256ns, L1 unlimited > Link: ASPM Disabled RCB 128 bytes CommClk- ExtSynch- > Link: Speed 2.5Gb/s, Width x1 > Capabilities: [100] Advanced Error Reporting > > The system is a dual core Conroe system running with 2gb of > ram and a memory split of 2gb/2gb. Kernel preemption is > voluntary. I can disable preempt but it may take a day or > two for the lockup to show up again. > > Also, the mtu on this iface is set to 9k btu and the lockup > seems more frequent at a good network load. > > Shane > 1. What is the interrupt usage: cat /proc/interrupts 2. Try with the workaround for lost IRQ's modprobe sky2 idle_timeout=100 - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html