Re: [PATCH] e100 rx: or s and el bits

Milton Miller Sun, 06 May 2007 00:19:01 -0700

[dropping Andrew, Jeff, and LKML]

On May 4, 2007, at 4:43 PM, David Acker wrote:

David Acker wrote:
So far my testing has shown both the original and the new version ofthe S-bit patch work in that no corruption seemed to occur over longterm runs.
I spoke too soon. Further testing has not gone well. If I use thedefault settings for CPU saver and drop the receive pool down to 16buffers I can cause problems with various forms of the patch. Withthe original S-bit patch I can get:

...

The updated patch produced a different issue. We got an RNR interruptindicating the receive unit got ahead of the software. The S-bitpatch removed any handling of this issue as it assumed the hardwarewould spinon the sbit. Apparently if both the S-bit and the EL-bit are set onthe same RFD, it follows the EL-bit handling. Printing the stat/ackand status bytes on the RNR interrupts I get:
status
01001000 = 0x48 = RUS of 0010 = No Resources, CUS of 01 = Suspended

stat/ack
01010000 = 0x50 = FR, RNR
or
00010000 = 0x10 = RNR
Notice that the RUS went into No Resources and not suspended. Thusclearing the S-bit does not wake it up; it needs a new start command.I could not find documentation that states that the S-bit need only becleared to take the RU out of suspended state. Before the S-bit patchthe driver tried to track this need but that version of the driverdidn't work for me either. By the way, I am using, "Intel 8255x10/100 Mbps Ethernet Controller Family, Open Source Software DeveloperManual, January 2006" as my documentation.
This got me looking at just how in the world this worked on the old
eepro100 driver. It had another difference; it did not reap the lastrx
buffer in the chain.  It set a postponed bit and then picked it up on
the next interrupt after more buffers had been allocated.  It then
noticed that the RU was in a suspended or no resources state and did a
softreset.
I don't believe this avoid the last buffer trick really fixes therace. Imagine the following:
1. 4 buffers in receive pool, all freshly allocated
2. Hardware consumes 3 buffers
3. Software processes 3 buffers, begins to allocate new buffers
4. Hardware writes status bits into buffer 4 while software updateslink and command word bits in buffer 4. They share a cache line andcorrupt each other.
This appears to be possible with any of the versions of this driver Ihave seen. The problem is one of packet ownership. Once the drivergives a list of buffers to hardware, hardware owns them all. Thedriver can not safely change these buffers. Sadly, this means thatthe idea of the driver "staying ahead" of the hardware such that thehardware never runs out of resources will not work here. Once thedriver gives the hardware a packet with S or EL bits set, it must letthe hardware encounter the packet and return it to software.
I think the driver needs to protect the last entry in the ring byputting the S-bit on the entry before it. The first time the driverallocates a block of packets, it writes a new S-bit out on the next tolast packet. As buffers complete it allocates more packets in thechain but does not set a new S-bit since the old one will stophardware. It can not clear the old S-bit because the driver does notown the buffer, hardware does. After processing the s-bit packet thehardware will interrupt with a stat/ack of RNR and RUS of suspended.When software processes a packet with an old S-bit it allocates newbuffers and sets the s-bit on the new next to last packet.
The above case changes now:
1. 4 buffers numbered 1-4 in a receive pool, all freshly allocated.S-bit is on buffer 3.
2. Hardware consumes 3 buffers, hits S-bit, RNR interrupts
3. Software processes 3 buffers, begins to allocate new buffers
4. Software sends resume once buffers are allocated, S-bit is onbuffer 2.5. Hardware gets resume. When it processed buffer 3, it saved thelink to buffer 4 and thus resumes at buffer 4.
Here is a different flow where the software stays ahead:
1. 4 buffers numbered 1-4 in a receive pool, all freshly allocated.S-bit is on buffer 3.
2. Hardware consumes 2 buffers (1, 2).
3. Software processes buffers 1, 2, begins to allocate new buffers
4. Software buffers 1, 2 are allocated
5. Hardware consumes 1 buffer (#3) and hits S-bit, RNR interrupts.
6. Software consumes 1 buffer, (#3) and finds the old S-bit. Itallocates a new buffer 3 and sets the S-bit on buffer 2.
7. Software sends resume, hardware continues at buffer 4.
In this setup, software will send a resume command every RING_SIZEpackets. RNR interrupts will also occur every RING_SIZE packets.When hardware is faster than software, it will process RING_SIZEpackets, RNR interrupt and wait for software to process all of them.When software is faster then hardware, hardware will still processRING_SIZE packets before interrupting but software will only need toallocate 1 packet or so before sending the resume so hardware willwait much less time.
This will probably slow things down since on a fast CPU, software willnormally stay ahead of the hardware and the only PCI operations fromthe driver would be interrupt acks. With this change, we have PCIoperations every 256 packets. I don't see how else to do this in asafe way on ARM (at least PXA255).
I am testing this over the weekend with a 16-buffer receive pool. Ifall goes well, I will send a patch early next week. It will basicallyback out the S-bit patch and then make the changes noted above.

While this will help the problem with the cache-incoherent DMA systemsnot running, it guarantees the hardware will stop every <ring-size>packets and wait for the cpu to respond to an interrupt. It would seemthat this will lead to packet drops.


[download manual from site in source file]

In fact 6.4.3.4 says 82557 will start dropping frames immediately.

Looking at the descriptions around page 101:

(1) The link pointer, S, and EL is read when hw starts recieving theframe.(2) Its pretty clear EL overrides S from the order of the descriptionsin the text.(3) 6.4.3.3.1 #4 looks intresting -- That is a RFD with size 0 skipsframe fill and goes to the next packet.

How about putting a zero length descriptor in consistent memory tosuspend the rx unit before the last real frame? In other words fr0-> fr1 ... frN-2 -> frN-1 -> WaitHere0 -> FrN. We could then have 2such frames, and when we refill modify FrN to the new chain, with theWaitHere1 as its next-to-last, do the syncs, then clear the S bit onWaitHere0. When the rx passes WaitHere0 we can reclaim it for thenext use (might want a slightly larger pool, basically need RxRingSize/ RxRingFillBatch such frames.


milton

-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] e100 rx: or s and el bits

Reply via email to