Re: multipart, server-sent events, and

Maciej Stachowiak Tue, 19 Feb 2008 10:51:06 -0800


Kris & Mark,

My latest information about the problems with enabling HTTP pipeliningis about five years old, so it will take me time to do research andprovide more detailed information. That being said, the last time weexperimentally enabled HTTP pipelining in internal Safari builds, wefound the following:


1) Some web servers on the public Web had problems with pipelining.
2) Some proxy servers had problems with pipelining.

3) In at least some cases, these problems took the form of scramblingtogether multiple replies in a way the HTTP library could not reliablydetect.

The upshot of #3 is that we couldn't even turn on pipelining and backoff if it failed. We would not be able to tell it failed in all cases;the result would be silent user data corruption. We decided that eventhough it would happen to a minority of users, it was not worth therisk for a potential performance optimization.

The upshot of #2 was that there wasn't even a good way to provide awhitelist of sites where it was known to work, since we could notpredict what proxy a user would be running.

It may be that these problems are no longer common. However, I don'tthink our position should be to assume there is no problem untilproven. Instead we should together try to investigate the currentstate of these issues. I will do my best to find whatever informationI can.


With this in mind, some specific replies to some of your comments:

On Feb 18, 2008, at 8:28 PM, Kris Zyp wrote:

Since the breakage is caused in at least some cases by proxies, itis not in general safe to let XHR users opt in since they maycontrol the origin server but generally would not control whateverproxies are between the server and the user.
Pipelining is a great potential performance improvement and it'ssad that it can't safely be used on the public Internet right now,so I hope we someday find a way out of the impasse.
There is a world of difference between browsers choosing to dopipelining where no one gets to opt-in, and XHR opt-in wheredevelopers know the origin server, may even know the full route forall users (as in intranets) and can make the calculating risk ofwhere they want to try pipelining or not, and can even code backupsolutions when pipelined requests fail.

I disagree that there is a world of difference. Since many of theproblems are with proxies, it still be unsafe to use pipelining on apublic site when specially requested.

It seems very presumptious to tell developers that can't take thatrisk. Why not tell them they can't use XHR at all, since there areold browsers out there don't support XHR? Because developers shouldbe given the opportunity to make this decision. Developers havechosen to use XHR even though there are browsers that don't supportit, and this has led to progress. If they experience too muchpipelining failure they can choose to opt-out. I am very skepticalthat at this point the failure rate is high enough that very manydevelopers would opt-out.

The nature of the risks that some browsers don't supportXMLHttpRequest, and that some configurations may result in problemswith pipelining, are not the same.

When content runs on one of the rare clients that does not supportXHR, it can detect that fact up-front on the client side, and providean error message or some form of graceful fallback. Even if it doesnot check and attempts to use XHR blindly, the negative consequencewill be that the operation completely fails in an obvious way.

However, when content requests HTTP pipelining in a configurationwhere it will fail, it cannot detect the failure up front or evenafter the fact. The result of proceeding in the face of failure willnot be a clear failure but rather data corruption.

While less than 1% of users may experience these kinds of problems, anAPI that may expose even a small proportion of users to unpredictabledata corruption is not a good idea. In particular, browsers thatimplement the ability to request pipelining will expose users to theseproblems, while older browsers will not, so the likelihood is highthat the browser will be blamed.

I would ask further why HTTP pipelining is especially more importantfor XHR than for normal fetching of HTTP resources. Fundamentally, itis nothing more than a performance optimization, and even the mostAJAXey of sites makes many more ordinary resource requests than XHRrequests.

Ironically, this is probably our best opportunity to get throughthis impasse. If web developers can selectively turn on XHR, the fewremaining proxies out there that break under pipelining will startto be singled out, and more likely to be fixed which in turn willcreate the pipeline reliability for browsers to use it more broadly.

I think there is a better way to break the impasse. (See end of thismessage).



On Feb 18, 2008, at 9:20 PM, Mark Baker wrote:

Hi Maciej,

On 2/18/08, Maciej Stachowiak <[EMAIL PROTECTED]> wrote:

Last time I looked into this, there were some proxies and some origin
server configurations (in particular certain Apache modules, perhaps
now obsolete) that broke with pipelining.


Can you define "broke"?

I've done a search on Apache and Squid pipelining bugs, and didn't
find any open ones.

I'll try to do more thorough research, but see above. The result wasundetectable data corruption in at least some cases.

Since it is not possible to
find out from the server if pipelining is correctly supported, and
since it is not generally possible to tell from the response that it

has failed, enabling it by default in the browser http stack wasnot a

safe thing to do.

Since the breakage is caused in at least some cases by proxies, it is

not in general safe to let XHR users opt in since they may controlthe

origin server but generally would not control whatever proxies are
between the server and the user.

Pipelining is a great potential performance improvement and it's sad
that it can't safely be used on the public Internet right now, so I
hope we someday find a way out of the impasse.

Well, I'd like to see some hard evidence of this before we write itoff.

I think we need some research before making a decision either way. Ijust wanted to make clear the nature of the risks, and in particularthat content author opt-in was not sufficient to fully mitigate them.

In any case, I think there is a safer way to enable safe opt-in toHTTP pipelining, not just for XHR but for all content. I think thebest solution is to us a hop-by-hop HTTP header to signal server-sidesupport for pipelining. In particular, the "Connection" header is hop-by-hop, allows an open-ended series of values, and is semanticallyappropriate for this purpose. So I would propose that we define anHTTP response header field of "Connection: pipeline" to indicate thatthe server supports pipelining and encourages its use for thisparticular connection.

I believe this solution is better than a script-side solution in twoways:

1) The opt-in uses a hop-by-hop mechanism, so it will not causeproblems with proxies when the origin server can promise to correctlyimplement pipelining but an intervening proxy cannot. Therefore itwill save us the effort of even investigating the potential problemscaused by troublesome proxies and weighing their cost.

2) It can work for ordinary resource loads as well as for XHR, so thepotential performance benefit is much greater.

Probably the appropriate forum to make this proposal would be the IETFHTTP Working Group. I'll join the appropriate mailing list if othersare interested in pursuing it there. In advance of this, we couldagree by convention on an unofficial "Connection: x-pipeline" value tosee how well this proposal works in practice.


Thoughts?

Regards,
Maciej

Re: multipart, server-sent events, and

Reply via email to