Speeding up a Rails request by 150ms by changing 1 line

ankhmoop · on May 8, 2009

I'm not sure why individuals feel the need to write an entire page of text regarding a relatively straight-forward bug fixes. "Improved [Rails|Ruby|...] performance by [X] ms by changing [N < 20] lines of code" posts are becoming a common occurrence.

The regular occurrence of such issues in a project implies an immature, poorly profiled code base, and are the kinds of things that other projects fix without fanfare.

Our most recent database-backed dynamic web application can serve >10000 requests/sec on our mid-range hardware at <2 ms/req, which we achieved by running the most basic of profiling early on and throughout the development process, and using software that does the same. I don't expect a gold star for this.

ZeroGravitas · on May 9, 2009

You've missed an important but subtle point. The article points out that the reason this was overlooked is not because Rails isn't profiled but because this behaviour is happening outside the standard rails profiling scope. That is a) why it's not a "relatively-straightforward bug fix", b) why it hadn't been caught already and c) why it is interesting. Actually I take that back, it is a relatively-straightforward bug fix. As with all the best bugs it's finding it that wasn't straightforward.

I think a similar thing happened about a year ago when people were profiling rails but not profiling memory usage. After they realised GC was impacting their performance they shifted their focus and improved things in that regard. The most recent "X line Y% speedup" article I read here took this even further and examined the impact of the size of stack vs. heap memory on GC times.

My particular favourite example of this is Steve Souders work showing how much your web app performance is affected by the end users browser and its wacky network and cache behaviour. He has claimed these factors can account for 80-90% of the end-users' waiting time. I know rails core have been working to fix these issues as they did a presentation about.

That sounds to me like a bunch of useful lessons to be learned from. Certainly more info than just "profiling good", more like non-holistic profiling can be misleading. Your response certainly doesn't cover this as you talk about "basic profiling", so what is outside the profiling you do? How would you know if it is affecting performance and to what degree?

ankhmoop · on May 9, 2009

You've missed an important but subtle point. The article points out that the reason this was overlooked is not because Rails isn't profiled but because this behaviour is happening outside the standard rails profiling scope. That is a) why it's not a "relatively-straightforward bug fix", b) why it hadn't been caught already and c) why it is interesting. Actually I take that back, it is a relatively-straightforward bug fix. As with all the best bugs it's finding it that wasn't straightforward.

Properly profiling the full execution path should be an entirely obvious step to take.

kscaldef · on May 8, 2009

The short and to-the-point version of the post went to the Rails-core mailing list. On my blog, I prefer to tell stories.

Bragging about how fast you can serve a request is really meaningless when there's no way to know what's involved in that request. On my personal side projects (also using Rails), I serve requests in a couple milliseconds too. But, there's a vast, vast difference in what those sites do, vs what's involved in running a large-scale e-commerce operation.

ankhmoop · on May 8, 2009

The short and to-the-point version of the post went to the Rails-core mailing list. On my blog, I prefer to tell stories.

There are quite a few stories of this nature, which could simply be summarized as individuals finding incredibly simple inefficiencies and then fixing them.

kscaldef · on May 8, 2009

IME, many many developers have no idea how to measure and profile their code. The reason for the stories is to help educate people with less experience in what tools to use, and how to use them.

Raphomet · on May 9, 2009

I'll admit that I don't know enough about this. Got any pointers or links for the intermediate Rails hacker? (I'll check out ab, as mentioned in your post.)

jfarmer · on May 9, 2009

Great! Then you should find them and write about them.

bobbyi · on May 9, 2009

I like this type of blog post. I'd much rather read stories about people solving real problems with real code than blowhards pontificating about their favorite design patterns.

Whether Rails is immature doesn't make it less interesting to see how people track down its problems and fix them.

lsb · on May 8, 2009

Having lived in Ruby code for a bit, I think it's fair to say that you're entirely right. It's immature, it's poorly profiled, it has memory leaks, and the scoping rules are a nightmare.

But it sounds like you were doing profiling and optimization early and often, which might have been premature, and thus might have been a waste of time.

Who's to say? Ruby isn't for everyone. Perhaps your ideal web stack is what you're using. (What're you using, btw?) I've been itching to use Haskell for a webapp.

carbon8 · on May 9, 2009

"Perhaps your ideal web stack is what you're using. (What're you using, btw?)"

He has stated in previous comments that he's using scala.

Interestingly, merb on jruby has reportedly reached 14000 rps on similar hardware to what he's refering to (8 cores, since he previously claimed to get 5000 rps on a 4 core machine).

lallysingh · on May 8, 2009

Between the lot of these, the only message I'm getting is "ruby and rails authors happily dump speed in favor of not thinking terribly hard."

Seriously, is it that hard to pull up dtrace and do a little analysis? They even have dtrace-ruby bindings on MacOS! http://www.infoq.com/news/2007/10/ruby-leopard

icey · on May 8, 2009

Rails returns the body 1 line at a time? That can't seriously be true, can it?

kscaldef · on May 8, 2009

True. Seriously. At least in 2.3. Pre-Rack, I don't imagine that code would have been present.

carbon8 · on May 8, 2009

It's in 2.3 stable, but it appears to no longer be present in edge (in fact, ActionPack has already gone through a lot of changes):

http://is.gd/xP8Y (github link)

mtarnovan · on May 8, 2009

String.each is the same as String.each_line in ruby 1.8. This code rewrite seems to be the reason that code breaks on ruby 1.9 if body is a string (this is guarded against in the body setter). (ruby 1.9 removes String.each)

http://github.com/josh/rack/tree/master :

* NOTE: String bodies break in 1.9, use an Array consisting of a single String instead.

p.s. this is definitely worth a 2.3.3 tag and release

kscaldef · on May 8, 2009

yeah, 3.0 changes a lot. It does appear that it will not exhibit the same problem.

icey · on May 8, 2009

Is there a lot of per-line processing in Rails? Or is this something caused by Rack?

I know next to nothing about Rails, so I'm sorry if these are questions with obvious answers. I just think the decision to process body responses on a per-line basis seems odd.

kscaldef · on May 8, 2009

It's definitely odd. I'm inclined to attribute it to carelessness, and just not really thinking about the implications of precisely how they chose to implement the Rack specification.

What's frustrating is that apparently there wasn't any performance testing of the 2.3 release that ought to have found this problem. It's been in the code base for about a year at this point.

dasil003 · on May 8, 2009

How often is ActionResponse.body a String? It's not really clear to me without digging deeper, but I could see this easily being an edge case related to certain caching and rendering methods. In any case, suggesting that any performance testing of Rails would have magically found this problem is unfair. Many many many Rails app instances are serving orders of magnitude more than 10reqs/sec, so clearly this is not a universal problem.

My guess as to the reasoning here is so that large files can be served without blocking while the whole thing is written to the socket. Granted, that's just my gut instinct looking at that code, maybe it is just careless...

kscaldef · on May 8, 2009

I checked that, and I couldn't find any case where the body _wasn't_ a String.

It's true, that the problem only becomes really glaring when the pages being served are quite large (this example was 1300 lines of HTML), but I still assert that good performance regression testing ought to have popped this up. With 100 lines of output, which is pretty reasonable, you'd still be looking at an extra 10ms on each request.

EvilTrout · on May 8, 2009

I've been running Rails 2.3 in production for a while with great performance.

In the article the author mentions it has to do with serving static files, which is something that nobody should be using Rails to do anyway. I'm not surprised nobody found it until now.

kscaldef · on May 8, 2009

No, we only incidentally found it while working on a piece of our app that serves static files. The issue looks universal to me, although with dynamic content or smaller output, the relative hit would be less noticable.

EvilTrout · on May 8, 2009

I stand corrected!

oldgregg · on May 8, 2009

Why the... ???

zcrar70 · on May 8, 2009

> We’re pretty obsessed with performance at Gilt Groupe

Not wishing to troll, but I'm not sure that Ruby/Rails is you're looking for if you're obsessed with performance...

callmeed · on May 8, 2009

Not to put words in anyone's mouth, but perhaps he meant "obsessed with rails performance ..."

jasonkester · on May 8, 2009

It makes me want to cry that people are still writing things in technologies that require you to drop little timing statements into your code to profile them.

Profilers have been around for what? 30 years? I think the first time I hooked Ants Profiler up to an ASP.NET site was 2002. Are these other web technologies really that far behind the state of the art?

kscaldef · on May 8, 2009

I didn't feel like getting into the nitty-gritty details of my analysis, but Ruby does have a profiler, and it integrates into Rails quite well, and I was using it together with timers in this investigation. However, the exact same issue applies to the profiler as to simple timers, which is that the 150ms in question happens _after_ Rails hands off the reply to the web server, so as long as you are only measuring within the Rails framework, you will not see it.

jasonkester · on May 8, 2009

Strange. My expectation would be that you'd hook a profiler up to the web process and it would simply follow all ruby code paths, regardless of whether Rails were involved or not. Or do we have different definitions of Profiler?

You're talking about a separate application that watches executing code and keeps statistics, right? As in, not simply a set of timing functions in a wrapper somewhere in the framework? If so, how does it come off the trolley after Rails finishes its job?

kscaldef · on May 8, 2009

ruby-prof is a module that runs inside the ruby process, and hooks into the interpreter to track what's happening. You have options for how to run it. You can run the entire program within the profiler, but if you do that, you won't get data out until the process ends (you kill the webserver, in this case). Or, you can do what's more typical for profiling Rails, which is to explicitly activate the profiler for the course of a single request, and report on that single request.

mtarnovan · on May 8, 2009

Take these kind of comments to Reddit, you obviously know nothing about Rails and Ruby, especially Rack and Metal. "Metal components operate outside the realm of the usual Rails timing and logging components, so you don’t get any internal measurements of page performance."

jasonkester · on May 8, 2009

Indeed, I'd never heard of Metal before reading this post. The quote you included is what lead me to wonder why there are not better tools for such a popular development framework as Ruby on Rails.

As that quote states, there doesn't appear to be a profiler as such, but rather a set of "timing and logging components" built into the framework. If it's true, then ruby today is essentially handling profiling the same way PHP did it in 1998. I find that surprising and saddening.

mtarnovan · on May 8, 2009

See above comment about dtrace probes, also there is ruby-perf and services like http://www.newrelic.com/features.html

Also, what is the way PHP handles profiling in 2009 ? I haven't developed in php in quite a while, and I admit I never used anything more complicated then the profiling tools built in Zend Studio.