High Performance Networking in Google Chrome

harshaw · on Jan 31, 2013

Awesome article. I feel like the most game changing part of chrome is that Google turned it into a web app, that is it is always up to date and you never have to install anything (compared to the mess that is IE 7-10). Everyone gets the next network improvements / security fixes. It is so simple and mind boggling at the same time. I know this isn't the main point of the article, but dealing with microsoft stuff recently really hammers this point home.

alyx · on Jan 31, 2013

IE 10 is auto updated as part of Windows Update, is that too messy too?

SigmundA · on Jan 31, 2013

Requires Win7 SP1 higher first.

Chrome only needs WinXP SP2, while still supporting Mac and Linux.

This is a big deal at least with my customers. It's just silly that Chrome has broader Windows support than IE. IE should not be so dependent on the base OS.

mbell · on Jan 31, 2013

That depends - Is IE10 one of the many components in Windows that obnoxiously requires the entire system to reboot after an update?

harshaw · on Jan 31, 2013

I am sure IE10 support will be better but MS is burdened by their decision to limit IE9+ to windows 7 and newer. I read through the MS article on their new automatic upgrade policy and it seems mostly focused on security (for good reason).

What is clear is that the chrome development organization is built to release all the time in perpetuity (just like web, mobile development). Not sure this is true for MS.

arocks · on Jan 31, 2013

Needing to reboot after installing an update or being nagged to death about it, that is definitely messy.

aviraldg · on Jan 31, 2013

Yes, it is.

IgorPartola · on Jan 31, 2013

Here is another optimization every browser could (and should) take advantage of, but currently does not: http://igorpartola.com/programming/how-browsers-should-conne...

Basically, if there are multiple IP's behind a hostname, connect a non-blocking socket to each, then do a select/poll/epoll/kqueue on all of them. Then, once at least one returns, immediately close the rest and use the newly established connection.

This has three nice side-effects. First, if one of the hosts is down, it will never be selected, unlike now when there is a 1 in N chance that you'll be stuck waiting for a very long time. Second, you don't need to explicitly check whether the IPv4 vs IPv6 stacks are operational. A connection that is returned to you is the one that works, regardless of the underlying protocol. Third, this provides crude load balancing. Presumably the host that connected the fastest is also the fastest to process your request. The my blog post for some numbers.

igrigorik · on Jan 31, 2013

Yup! In fact, I didn't cover it in the writeup, but Chrome and other browsers already do this kind of thing on multiple layers: IPv4 and IPv6 (see Happy Eyeballs), plus to recover from lost SYN's.

IgorPartola · on Jan 31, 2013

Interesting. One of the apps I run is split between two servers. When I was bringing everything up, if one of the severs was down there was always a 50% chance of about a 40 second blank screen before the other server was picked up. If the server that was chosen was in the DNS cache, there was a 100% chance of blank screen. Admittedly, this was about 2 years ago, so things might have changed, but the reliability properties didn't seem to be there at the time.

hobohacker · on Jan 31, 2013

We've considered this before, and the performance gains do not appear worthwhile (we've run much more extensive tests than you have on top websites to evaluate potential gains), it has extra complexity, it is more of a Hampering Eyeballs approach rather than a Happy Eyeballs approach, it has potential web compat issues (many sites will not expect you to connect to the IPs other than the first), etc etc.

IgorPartola · on Jan 31, 2013

Sorry, but I can't tell from your profile who "we" is. Are any of your results public? Also, what do you mean by "many websites will not expect you to connect to the IPs other than the first"? My OS's DNS resolver randomizes the A/AAAA records I get on every resolution. The website operator cannot control which IP I will connect to. If you mean that websites don't expect two simultaneous connections, then I would be curious how two different servers would coordinate the fact that each got a TCP connection from the same IP at any scale. How does that interact with NAT? For example I believe some IBM campuses are behind giant NAT's with a single IP address per building or some such. Which websites get confused by this?

hobohacker · on Jan 31, 2013

Sorry, I'm a Chromium dev, and in particular I'm the net maintainer who authored the majority of the current connection management code you are discussing. Our results are not public. The web compat concerns are with more exotic, enterprise configurations, not public ones. They are not the most compelling concerns though.

IgorPartola · on Feb 1, 2013

Cool. +1 for the clarification.

gcr · on Feb 2, 2013

I don't like this idea because now a single client will tax all of your servers at once.

ajtaylor · on Jan 30, 2013

Ilya has never disappointed me with any of his articles. They are always top notch, and this one is no exception. As a Chrome user, seeing how things work behind the scenes (and why) is fascinating!

ajtaylor · on Jan 31, 2013

After processing the full article, I'm completely amazed at all the extreme lengths Chrome goes employs to speed page loading. From the networking stack, with it's abundance of optimisations and resource reuse, to the massive (and slightly scary) intelligence of the predictions stack, it all combines to make one heck of a browser!

Even if Chrome had only 10% of its current features, I appreciate the fact that it provided competition into the browser market and spurred innovation by _everyone_. The benefit to the internet community as a whole is tremendous.

ericflo · on Jan 31, 2013

Interesting note that the network stack is implemented "as a mostly single-threaded (there are separate cache and proxy threads), cross-platform library."

I wonder if someone could extract that lib into its own project. It would be great to have a version of AFNetworking or Python's requests library built on top of this -- in order to take advantage of the advanced socket reuse and late binding stuff that he describes in the article.

But that's all without really looking at the code. Maybe there's just too much Chrome-specific stuff there.

igrigorik · on Jan 31, 2013

It already exists! And works very well! Check it out: http://code.google.com/p/chromiumembedded/

hobohacker · on Jan 31, 2013

This comes up periodically on chromium-dev. I've answered it here: https://groups.google.com/a/chromium.org/d/msg/chromium-dev/....

In short, we don't want to support other consumers, sorry. Feel free to do code drops, they should work fine.

robotmay · on Jan 30, 2013

Ilya always writes the best performance articles. Really nicely organised page too.

nzonbi · on Jan 31, 2013

Great article. Chrome is an amazing piece of software.

Talking about web browsers, I can't wait for the next big step. Mozilla and Google are already moving ahead: When web browsers become the whole OS. With web apps becoming fully capable, and with native apps increasingly network integrated, the distinction is becoming meaningless.

In that vision of web browsers as OSs, there is a big limitation. More than some still incomplete APIs, or a few ones still missing, the problem is the whole development story. Web development is still a chaos. The web platform, is a soup with far too many ingredients. Why we still have to develop application's UIs, using a document markup engine?.

The web is rich. There are all kinds of resources. It is non-sense to express all of these resources, with a single development paradigm: html-css-javascript. Native development, is more flexible. There is the possibility of choosing the right tool for the job. I hope that one day not so far away, browsers will gain similar capabilities. Support for multiple separated, specialized development environments.

chipsy · on Jan 31, 2013

I think the necessary reworking is going to happen, but it will take a few cycles through the "Wheel of Reincarnation" to make it fully realized - with the rendering example, we now have Canvas and WebGL; instead of relying on the browser, you can create something completely customized, with the downside being that many of the supporting features of the document-renderer paradigm are lost. But it's the right technology for certain kinds of applications, and others can be rebuilt on top of that, and eventually you'll get frameworks, and so on.

At each step, not everyone will be 100% happy with how things are organized, but enough of it will work that it expands the frontier of what kinds of software can be built on the web. We've been through this before - the web has worked because its paradigms keep getting stretched beyond expectations.

newman314 · on Jan 30, 2013

Great article but I wish some discussion would have been directed at the security implications of all this prefetching.

simonw · on Jan 31, 2013

Can you expand on this a little - do you know of any particular security concerns with prefetching?

newman314 · on Jan 31, 2013

IIRC, there are issues with leaking URLs just as with OCSP lookups. So it's more privacy related than security. I was in a hurry when I typed that earlier. But if you think about it, there's still a chance of sending company internal URLs to a third party which given enough data would start to build up a view of the intranet. We have no idea if google then feeds this into their crawler to get more of the hidden web and if there is a misconfiguration elsewhere would then cause results to be publicly visible. See posts about visible printers etc. in the last couple of days.

hobohacker · on Jan 31, 2013

Chromium builds predictive models which are stored client-side. They are not uploaded to a third party server, like Google.

easytiger · on Jan 31, 2013

how is it any different that fully, manually fetching everything? Surely the only difference in psychological as per

timc3 · on Jan 31, 2013

Wonderful wonderful article. Is there somewhere I could pre-order the book (which would be the first time that I have ever done so).

est · on Jan 31, 2013

Ot here, the network might be high performance but the socket API in Chrome need some fix.

https://code.google.com/p/chromium/issues/detail?id=170595

the chrome.socket API is really awesome, it can listen to tcp/udp connections, works like a mini NodeJS. It's a shame they didn't make it more stable.

kinlan · on Jan 31, 2013

I coded that demo, it is not the Chrome Socket API that is the problem, it is that I don't error properly in the code and it blocks the port like any socket server would.

est · on Jan 31, 2013

Thanks for the demo

How to catch that error? There's no exception callback in the doc

http://developer.chrome.com/apps/socket.html

And if the port is already blocked, how to recover?

jadc · on Jan 31, 2013

I am wondering about the kernel piece. Does Chrome on Windows have any code that is loaded in the kernel? I'd be quite surprised if they did. Any Chrome expert know?

ramidarigaz · on Jan 31, 2013

I don't believe he actually means the OS Kernel. It's a userspace "kernel" in that it coordinates the traffic and behavior for all of the subprocesses.

igrigorik · on Jan 31, 2013

Right. The "browser kernel" process, which is the coordinator for all the network activity, and a dozen other things.

ErikRogneby · on Jan 30, 2013

This pre-fetching behavior reminds me of Amazon Silk. However Silk leverages EC2, vs. doing all of the analytics locally.

steeve · on Jan 31, 2013

Ilya Grigorik delivers, again.

eliben · on Jan 30, 2013

AOSA are very nice books. It's exciting that another (POSA) is coming.

No1 · on Jan 30, 2013

They really need another acronym. I thought you meant another POSA[1] is coming.

[1] http://www.cs.wustl.edu/~schmidt/POSA/

eliben · on Jan 31, 2013

Good point. I hate it when folks make up names and acronyms without checking first for existing work, at least in related areas.

sparx · on Jan 31, 2013

awesome reading. other posts by Ilya Grigorik are great too.

boringguy · on Jan 31, 2013

Reading this @ FF :)