One thing many people are not aware of: statically linked programs run much fast...

nnnnnnnn · on Aug 13, 2012

On the contrary, fork() has scalability trouble as memory grows. Even with copy on write, copying page tables is still O(1) with respect to address space (granted, with a significant divisor). This overhead becomes apparent as programs grow to gigabyte size -- a fork which before took microseconds can begin to take milliseconds. Forking is slow in many situations.

The issue described above can be avoided by using posix_spawn(3), which on linux uses vfork(2).

jmah · on Aug 13, 2012

Do you mean O(n)?

nnnnnnnn · on Aug 13, 2012

Sorry, yes. That's what I get for posting before coffee :)

The cost of fork() is linear with respect to memory size.

forkproc1582 · on Aug 13, 2012

To the contrary, using dynamic libs allows the OS to cache commonly-used libs. Almost every program uses libc. When your process does a dyld load path as part of loading the ELF binary, the OS has the option to load a cached copy of the library; and possibly not even allocate an extra page of memory for it.

Using shared objects has a number of pitfalls: wasteful duplication on your hard disk, wasteful copying of the ELF binary into memory when you could tap the OS library cache for your dependencies (including extra page allocations), and the inability to upgrade a dependency of a binary without recompiling the binary.

noselasd · on Aug 13, 2012

If you're anyway fork/exec'ing a program you'd run before (which you have, if your're in an environment where this would matter), the binary is anyway cached in memory by the filesystem cache. But you don't have the processing overhead of doing the dynamic linking and possible relocation, nor do you pay the overhead of calling functions in a shared library. If the library is relocated, you don't even save memory.

For overly large programs, statically linking in e.g. an X11 enviroment, it might matter.

jff · on Aug 13, 2012

On the other hand, you'll never find that one of your programs has stopped working because Ulrich Drepper changed libc behavior again.

gillianseed · on Aug 13, 2012

Ulrich Drepper no longer works on glibc, last I heard he was working for Goldman Sachs

shurane · on Aug 13, 2012

|Another great thing is that shell scripts and pipes naturally and automagically take advantage of multi-core systems

What about the sharing of state?

mcguire · on Aug 13, 2012

"What about the sharing of state?"

There isn't any shared state between components of a pipeline.

dllthomas · on Aug 14, 2012

They don't naturally and automagically take advantage of that, no.

mhurron · on Aug 13, 2012

> Unix once again beautifully shows how simple and beautiful concepts like fork and pipes have unforeseen benefits many decades after they were invented

Multiprocessing and using multiple processes (instead of threads) to take advantage of them predates UNIX, by a lot.

ajross · on Aug 13, 2012

That's not the point. Unix invented the idea of a simple system call to create a process by forking, and (more importantly) the idea of a "pipe" syntax in the shell to connect data streams between processes in a natural and intuitive way. These were usability and elegance enhancements, not performance things.