Re: [Bloat] queuebloat

Jim Gettys Wed, 13 Apr 2011 10:22:17 -0700

On 04/13/2011 12:29 PM, Bob Briscoe wrote:

Jim,
By the end I think I had already addressed a lot of the concerns youstated at the start of the mail:
- Yes, the name of this exercise is water under the bridge.
- Buffers still have to be reasonably sized (my footnote covered thatalready)
However, three responses inline (prefixed "BB:")...

At 15:30 13/04/2011, Jim Gettys wrote:
On 04/13/2011 07:19 AM, Bob Briscoe wrote:
The problem is actually queuebloat, not bufferbloat. The buffer isthe memory set aside for the queue. The queue is how much of thememory is used to store packets or frames.
I think you are picking nits on the naming, though if you'd had thesuggestion last fall, I might have gone for it
BB: As I said, I'm picking nits on the naming, not suggesting itshould be changed at this stage.
But having a misleading name does make the nuancing harder - there's alot of practitioners out there who don't need or want to understandanything - they have no idea about why they should do things - theyjust put together strings of feature buzz-words. That's how most ofthe industry works.
It only needs some researcher with only a partial grasp of the issueto pick up the word bufferbloat as the new sexy research fashion, thenpublish their research results showing that smaller buffers will makethings worse. Then we have to start explaining we didn't really meanbufferbloat, yada yada, and it starts to make us look like we mightnot have known what we were talking about. While our researcher friendwith half a brain starts running around crowing that his marvellousnew research has proved us wrong,... when all he's actually done isproved that the word we chose as a name was not quite precise enough.

Heh. We didn't have any term for this at all. I went back and lookedat the discussion in end-to-end interest when Dave Reed reported 3gbufferbloat, and the suggested alternatives were worse, and no consensusreached.

And there are buffers that hide in systems that are not packetqueues, that people also should be awareof (e.g. encryption buffers, error correction buffers, buffers inapplications used for pipelining, etc).
BB: Good point. I guess your point again is that many of these buffersare not anything like as parsimoniously sized as they should be.

Yes, often they are infinite and dynamically allocated (e.g. the eventqueue inside of GUI applications and/or window systems themselves).

But these buffers are harder to cut down below a certain minimum,because they actually serve a function. There's no magic like AQM thatcan keep these buffers unoccupied most of the time.

Sometimes yes, sometimes no. Often, as in packet queues, the buffersfill because flow control from lower layers of buffering/queuing havefilled, and the software is not designed to elide unneeded operationswhen they can't keep up (again, causing buffers/queues to form justbefore the bottleneck).

I'm happy to also use a term queuebloat in places where it isapplicable, where you have packet queues... But bufferbloat a genericphenomena in communications programming, whether in network transports,or in applications using them. I guess in this I'm an odd-ball, havingmostly been a programmer who designed network based application. Let megive a concrete example:

Oh, and I forgot about socket buffers, which on modern OS's may alsoautomatically resize; these are not queues either. Even worse, is thatthey will resize based on the underlying confusion induced by otherbufferbloat/queuebloat underneath them. These can be controlled byapplications setting the socket buffer size, rather than taking defaultbehaviour. Again, at least for stream based protocols such as X, thesearen't yet queues (though we then parse the stream, and generate a queueof X events).

A good (recent) example I've seen is in OpenOffice, which has hadterrible behaviour on its slide arranging operations on Linux for years,not understanding it should discard unneeded mouse motion events (seemsto be one of the things the LibreOffice guys may have fixed, thankfully;I talked to Michael Meeks about this a while back). Bufferbloat affectsapplications just as much as network stacks.

So I'm not convinced that queuebloat is a better term, as it is lessgeneral than the phenomena I was tryingto describe. In any case, I think it's water under the dam at thisdate.
We don't want vendors to (necessarily*) reduce the size of thebuffer, we want them to reduce the size of the standing queue. Theycan do that with active queue management (AQM) (if we only knew howto code it robustly). Ideally with ECN too, but AQM would be a goodstart.
Some of these buffers are truly bloated, and/or not sized evenapproximately related to the bandwidth available (e.g. the 1.2seconds of buffering I observed on my DOCSIS3 modem, or similarhorror stories in DSL), or the 1000 packet transmit queue in Linux.These buffers are often sized by all the memory that is available,and the hardware vendors can't get small enough chips to "correctly"size them, (as though we knew what the bandwidth was, or the delaywas, one of the mythologies that got us into this mess).
One of the first steps (well short of the nirvana of AQM), is to atleast get the buffers sized to something sane, and related to thebandwidth the hardware is being operated at. And as each generationof new kit is built (and often as a market requirement has to pluginto downward compatible hardware), it's been getting worse.
This is what the cable folks are in the middle of doing; it'sobviously safe to at least have the buffer sizes approximatelyproportional to the bandwidth at which the device is operating(similarly for the Linux transmit queue; if you are at 100Mbps, youcan cut the size by a factor of 10 without any danger). With theability to go hundreds of megabits/second but most customers payingfor 10-20Mbps, it is pretty obvious the buffer size had better berelated to the bandwidth of operation, and never be a static buffersized for the worst case.
Let's not lose sight of immediate, safe mitigations that are at hand,while working on AQM with or without ECN, though that is the onlyreal, long term solution.
BB: The two stage fix might work for some types of product, wherecontinual fixes are the norm. But in other types of product, each fixinvolves an engineer visit and a box swap out, which you don't want tobe doing more than once if you can help it.

Yup. Would that we had AQM's that we knew worked in the face of highlyvariable bandwidth and workloads we could just recommend everyone gouse: but we're not there yet.

At best, we have some not yet tested ideas and are still getting set upto try to run even simple tests (e.g. SFB, RED light when we can get ourhands on it).

And we certainly *want* operators who could/should be running REDalready to turn it on in places where it can be used. My point isprimarily that the enemy of the good is the perfect, and steps we cantake to make the problem less severe while working on AQM that canhandle the current edge are well worth taking. Sometimes those stepsmay make the problem 1/10th the size it is today. That doesn't get uswhere we ought to go, but it will reduce suffering.

            - jim

_______________________________________________
Bloat mailing list
[email protected]
https://lists.bufferbloat.net/listinfo/bloat

Re: [Bloat] queuebloat

Reply via email to