> *Time of "Getting that shit done" and cost of long-term maintenance are tw...

vidarh · on April 14, 2013

> Then you're not actually performing performance analysis. Python is S-L-O-W.

You totally miss the point. There are large classes of systems where CPU time in your app is not a significant factor. E.g. my go-to example of this is the first Ruby project I deployed in a production environment:

A queueing middleware server. If we served the queues entirely out of memory we could serve a few million messages a day out of 10% of a single 2005-era Xeon core (and while Ruby doesn't scale across cores, this system could easily spread load by simply running multiple queue processes).But of of that 10%, 90% was spent in kernel space. So even if switching to C (or whatever) were to speed up the user-space part of the code 90%, it would only cut CPU time spent from 10% to 9.1%.

This is fairly typical. If your code is CPU-bound, and it's not CPU bound because of stupid algorithm choices, then things can look very different. The first thing to do is find out if you're CPU or IO bound.

pifflesnort · on April 14, 2013

> You totally miss the point. There are large classes of systems where CPU time in your app is not a significant factor. E.g. my go-to example of this is the first Ruby project I deployed in a production environment:

This is only true if you ignore response latency as a comparative metric entirely.

> But of of that 10%, 90% was spent in kernel space. So even if switching to C (or whatever) were to speed up the user-space part of the code 90%, it would only cut CPU time spent from 10% to 9.1%.

I'm really curious what you were doing in Ruby that spent 90% of your runtime in the kernel. Having implemented similar middleware, I found that nearly all my runtime was spent just performing basic decoding/encoding of framed packet data, and as a kernel developer, I'm having a hard time imagining what an in memory queue broker could have been doing to incur that overhead

vidarh · on April 15, 2013

> This is only true if you ignore response latency as a comparative metric entirely.

Response latency makes scripting language unacceptable for some types of problems. But in my experience very few latency problems I've come across are down to language choice vs. badly written database queries, lack of caching etc., that'd still be unacceptable regardless of language. Basically, the moment a database is involved, that's where you're most likely to see your low latency going out the window.

Of course there are situations where language choice definitively matters in terms of latency. Despite using mostly Ruby now, there are certainly places where I'd use C. E.g. I'd never try to replace Sphinx search with pure Ruby code, for example.

> I'm really curious what you were doing in Ruby that spent 90% of your runtime in the kernel. Having implemented similar middleware, I found that nearly all my runtime was spent just performing basic decoding/encoding of framed packet data, and as a kernel developer, I'm having a hard time imagining what an in memory queue broker could have been doing to incur that overhead

The moment you find yourself "decoding/encoding framed packet data", you have lost if your goal is a really efficient message queue if said decoding/encoding consists of more than check a length field and possibly verify a magic boundary value.

If the userspace part of your queueing system does more than wait on sockets to be available and read/write large-ish blocks, you'll be slow.

Of course there are plenty of situations where you don't get to choose the protocol, and you're stuck doing things the slow way, and sometimes that may make a language like Ruby impractical.

But there are also plenty of situations where in-memory queues are also not acceptable, and the moment you hit disk, the entire performance profile shifts so dramatically that you suddenly often have much more flexibility.

pifflesnort · on April 15, 2013

> Response latency makes scripting language unacceptable for some types of problems. But in my experience very few latency problems I've come across are down to language choice vs. badly written database queries, lack of caching etc.

5 ms you save in your runtime -- for free, without any work on your part -- is 5 ms longer you can now afford to spend waiting on the database.

> But there are also plenty of situations where in-memory queues are also not acceptable, and the moment you hit disk, the entire performance profile shifts so dramatically that you suddenly often have much more flexibility.

I can only disagree here. I view CPU cycles and memory as a bank account. If you waste them on an inefficient runtime, you can't spend them elsewhere, and there are always places where they can be spent.

Excluding huge sections of the problem domain -- such as "decoding/encoding framed packet data" -- is a cop-out. There's no valid reason for it, unless you're going to go so far as to claim that Ruby/Python et al increase developer efficiency over other language's baselines so dramatically that it outweighs the gross inefficiency of the languages.

scott_w · on April 14, 2013

I think you missed the point. There are plenty of use cases where Python isn't the cause of slowness.

It doesn't matter if C is 100x faster than Python if your app is running 49/50 seconds running SQL queries.

coldtea · on April 14, 2013

No. But it DOES matter to me than Python restricts you to using it for those cases, whereas I would want (and could technically be able to) use it for far more stuff.

pekk · on April 14, 2013

So you prematurely optimize Python out of everything you write, because it might not be suitable for... some use case yet to be determined?

coldtea · on April 14, 2013

No, I don't "prematurely optimize Python out of everything I write".

The exact opposite, I would like to be able to use Python for more stuff but I can't because it's slow.

Nothing about doing it "prematurely" here. I have tried it for specific problems and it was slow. If anything, having built in in Python first is the reverse of "prematurely". And I have, as many programmers have, a very good general idea of where it's slow and what it can be used adequately for without resorting to C.

And what I argued is that people constrain themselves unconsciously, based on that very knowledge, to the kind of projects they do with pure Python.

Nobody writes a AAA game in Python. Nobody tries to build a browser in Python. Nobody writes a performant Apache/Nginx class web-server in Python. Nobody writes a database engine in Python. Etc etc.

(Note for the "Vulcan"-style literal readers: by "nobody" here I mean "not many" / "not a sane person". I'm sure you can find some experimental, bug ridden, half-developed example of all of those done in Python, with maybe 100 users at most. Oh, and I specifically wrote: "pure python").

rbanffy · on April 15, 2013

> Nobody writes a AAA game in Python. Nobody tries to build a browser in Python. Nobody writes a performant Apache/Nginx class web-server in Python. Nobody writes a database engine in Python. Etc etc.

Almost nobody does any of those things regularly. I've written perhaps two ISAPI modules (in C++), one Apache module (in C) and in none of those cases the code I wrote replaced the bottleneck. It was just that that was the "right" place to put the code. I've done performance analysis countless times (the first time was, indeed, to demonstrate moving off VB3's p-code and into Turbo Pascal's native code was going to have a negligible effect on the performance of a client-server application) and I've seen, over and over again, the bottleneck being the network and the database.

For the exceedingly rare case it's not, I agree, Python/Ruby/Perl/Smalltalk is not the best choice.

voltagex_ · on April 15, 2013

>Nobody writes a AAA game in Python.

No, but the scripting/UI/mod toolkit for the game might be in Python

>Nobody writes a performant Apache/Nginx class web-server in Python

No, but Python can do quite well serving WSGI to Nginx

>Nobody writes a database engine in Python

No, but Python ORMs work pretty well.

The point I'm trying to make is that I don't think that it's a bad thing that Python sometimes gets relegated to a "support" language.

I think the time of the single language programmer is over. If you need something faster than Python, use something else.

coldtea · on April 15, 2013

>The point I'm trying to make is that I don't think that it's a bad thing that Python sometimes gets relegated to a "support" language.

Of course it is bad. It would only be good if the inverse (being able to write the whole thing in Python and get the same speed) would be worse.

But it clearly is not -- hence, it's bad. It might be pragmatic and necessary compromise at the moment, but that doesn't make it "not bad".

>I think the time of the single language programmer is over. If you need something faster than Python, use something else.

On the contrary, multi language projects are a kludge born out of (pragmatic) necessity.

Nothing inherently inevitable (or good) about them.

pwang · on April 15, 2013

> On the contrary, multi language projects are a kludge born out of (pragmatic) necessity.

> Nothing inherently inevitable (or good) about them.

Um, except that coherent planes/tiers of abstraction is the foundation of robust software engineering. You write languages to a spec; compiler authors emit bytecodes for another spec; hardware engineers optimize transistors and lay traces on silicon to meet yet another spec. Etc.

Multi-language is the only thing that makes the world work today. The closest single-language runtime that exists is FORTH and things like that.

Every other "modern" language where you get to pretend your process has a contiguous address space, or that there even exists a thing such as "process", is s pragmatic compromise between compiler and runtime technology.

coldtea · on April 15, 2013

>Um, except that coherent planes/tiers of abstraction is the foundation of robust software engineering. You write languages to a spec; compiler authors emit bytecodes for another spec; hardware engineers optimize transistors and lay traces on silicon to meet yet another spec. Etc.

Except that we talk about stuff that should be in the same level of abstraction in the first place. It's not about writing a compiler or OS in Python, or using Python for CPU microcode.

It's about writing the numerical routines in a scientific/statistics package in the same language that you write the core business logic, etc.

vidarh · on April 15, 2013

> Nobody writes a AAA game in Python. Nobody tries to build a browser in Python. Nobody writes a performant Apache/Nginx class web-server in Python. Nobody writes a database engine in Python. Etc etc.

But most developers do none of those things in the first place, regardless of language.

nostrademons · on April 14, 2013

"So then I had to rewrite critical sections of the Python in C"

I think that's kinda the point; rewriting critical sections in C is pretty much Working As Intended. That's why it's a scripting language. The point is that you didn't have to write the whole thing in C.

coldtea · on April 14, 2013

>I think that's kinda the point; rewriting critical sections in C is pretty much Working As Intended. That's why it's a scripting language.

No, it's pretty much not "Working As Intended".

It's just "Working as we have been used to, because, well, what can you do?".

As V8, IonMonkey, Dart, Smalltalk VMs, LuaJIT, even PyPy etc have showed, nothing prevents Python from being an order of magnitude faster (and sometimes up to two), while remaining just as dynamic.

Rewriting parts of a Python program in C is not a technical necessity due to some (unsurmountable) theoretical limit. It's a technical necessity because of a non optimal interpreter implementation.

pekk · on April 14, 2013

It isn't a necessity at all. It's an available option.

How would you write PyPy to be "an order of magnitude faster"?

fauigerzigerk · on April 14, 2013

Do you think that most Python developers can write safe C code? I doubt it but I have no data on it either.

nilliams · on April 14, 2013

A fairly moot point. Do you think most C developers can write safe C code? The 2 I've encountered in my short career basically have no idea what they're doing (which I suspect is partly why they stuck with C instead of exploring alternative languages).

fauigerzigerk · on April 14, 2013

If that were the case it would make the situation even worse. It means the whole "write performance critical parts in C" argument is bogus.

nostrademons · on April 14, 2013

To answer your original question - no, but there's less C code than the equivalent C-only program, so you're still better off. Or if your argument is that C is a security nightmare (which you won't find much argument from me on), replace C/CPython with Java/Jython.

pcwalton · on April 14, 2013

You could write performance critical parts in a safer language, though, such as Rust (disclaimer: I work on Rust). C isn't the only option here.