Fastcat – A Faster `cat` Implementation Using Splice

cojo · on Aug 1, 2018

> "Nice, but why on earth would I want that?" I have no idea.

I know this is referring mostly to the `cat` portion and not the `splice` portion of the article, but I'll throw in a quick shoutout to `splice` for giving me one of the single biggest build performance wins in my time at Zynga (and possibly across most teams at the company at the time).

We had a ruby script which ran the majority of the build, and as the game grew we found that by far the slowest part was a loop which MD5 hashed each individual asset and used that as its filename on our CDN for per-asset-versioning.

At its worst it was taking nearly an hour and a half; the code was basically as inefficient as you could make it - multiple shell calls for each file rather than any sort of inlining of the hashing process.

I wrote a basic C program using splice and an MD5 library which took the whole process to under 10s. A bit overkill, perhaps, but the naive speedup I tried first still took over 1-2 minutes, and I figured 99.99% was worth the extra few hours to put it together knowing how many builds we ran each day.

Definitely gave me a healthy appreciation for the cost of transferring to user space that has stuck with me.

accrual · on Aug 1, 2018

> In this case, if you notice that cat is the bottleneck try fcat (but first try to avoid cat altogether).

"Useless Use of Cat Award" [0] is the canonical text for avoiding unnecessary use of cat, for those who haven't come across it yet.

[0] http://porkmail.org/era/unix/award.html (2000)

jgtrosh · on Aug 1, 2018

Last time I posted a link to that, I received quite a few replies where people find it more natural to use `cat file | …` even when unnecessary — so even though I agree with the intent of the page I feel like it's useless to try and evangelise every case. If cat is the bottleneck though, fair game.

e12e · on Aug 1, 2018

First, if cat is slower than redirection from file (<file) - then I'd say something is amiss. But more to the point - I think it's really a bug that tools like gzip, grep, awk etc work on files at all. We do need a tool to feed files to pipes (I think cat is a fine candidate for that - also when we only con-cat-enate one file (the identity cat, if you will).

Maybe there are cases where a long string of awk|something|other|sort|uniq is not the problem, but forking an extra process for cat is.

And maybe there's a mismatch between pipes, files and mmap today. Splice seems like a reasonable fix (if we splice all the things, awk, grep etc).

Finally, I just think:

  cat input.txt \
   | filter1 args \
   | filter2 args \
   | reduction \
   | ouput-formater

Reads better than having to tack on an <input.txt at the end, or special-case the first filter to be (... And also open a file).

btschaegg · on Aug 2, 2018

> I think it's really a bug that tools like gzip, grep, awk etc work on files at all.

I'd say that is somewhat of a harsh premise, especially since the in-place editing of files available e.g. in many GNU tools (awk, sed, sort) is really useful and based on exactly that possibility.

I do agree that cat often makes pipes easier to read, though. And yes, obsessing over that one additional process seems to be somewhat silly. Unless, of course, it introduces a real bottleneck and the whole thing is time sensitive.

Pete_D · on Aug 1, 2018

You don't have to put the redirection at the end, you can write

    < input.txt filter1 args # ... rest of pipeline

(But I agree that the cat version is more readable.)

e12e · on Aug 1, 2018

True. I might not mind as much if it was (possible to, in a sane way, do): "< input | (...)"

madmax96 · on Aug 2, 2018

>if cat is slower than redirection from a file - then I'd say something is amiss

With `cat`, a new process must be created. But with shell redirection, no new process is necessarily created, so that is going to be faster.

jfhufl · on Aug 1, 2018

One good reason (IMO) for doing "cat file |" is it's easier to grab the command from your history and change it to something like "grep foo file |" rather than if you had run "cmd < file".

waterhouse · on Aug 1, 2018

Also, if you have a long pipeline:

  cat file | this | that | other | out
  vs
  this < file | that | other | out

In the cat example, it's easy to change the head of the pipeline, by adding things before "this" or deleting "this", which is less so in the non-cat example. (The use case I have in mind is experimental commands that take probably <10s to complete, where editing time is a significant fraction of the time you spend.)

iforgotpassword · on Aug 1, 2018

  < file  this | that | other | out

jgtrosh · on Aug 1, 2018

My counter argument would be that with a good shell line editor like zsh in vi mode command transformations are as cheap as modular grammar; however I know there's limits to that argument (Java is only writeable in Java IDEs) so I'll grant you that :-)

mre · on Aug 1, 2018

Thanks, added a link.

pantalaimon · on Aug 1, 2018

I re-implemented it in C and for some reason O_APPEND is set on stdout by default.

But aside from that it works just as the Rust version.

  #define _GNU_SOURCE
  #include <fcntl.h>
  #include <stdio.h>
  #include <stdlib.h>
  #include <string.h>
  #include <unistd.h>
  
  #define BUF_SIZE 16384
  
  static void unset_flag(int fd, int flag) {
  	int flags = fcntl(fd, F_GETFL, 0);
  	flags &= ~flags;
  	fcntl(fd, F_SETFL, flags);
  }
  
  int main(int argc, char** argv) {
  	int pipefd[2];
  	pipe(pipefd);
  
  	unset_flag(STDOUT_FILENO, O_APPEND);
  
  	for (int i = 1; i < argc; ++i) {
  		int fd = strcmp(argv[i], "-") ? open(argv[i], O_RDONLY) : STDIN_FILENO;
  		if (fd < 0) {
  			fprintf(stderr, "%s: No such file or directory\n", argv[i]);
  			exit(1);
  		}
  
  		while (splice(fd, NULL, pipefd[1], NULL, BUF_SIZE, 0))
  			splice(pipefd[0], NULL, STDOUT_FILENO, NULL, BUF_SIZE, 0);
  
  		close(fd);
  	}
  
  	return 0;
  }

WTFPL if anyone cares.

pantalaimon · on Aug 1, 2018

Turns out the buffer size is significant: https://imgur.com/a/f4LiHVI

With 32kiB buffers I get double the throughput than with 16k, the peak appears to be at 64k, after that it levels off.

jwilk · on Aug 1, 2018

Direct link to the image: https://i.imgur.com/5jOb1yo.png

berbec · on Aug 2, 2018

Could that be the size of some internal cpu cache/register/internal bank of memory? (not sure how properly describe it)

the8472 · on Aug 1, 2018

Newer kernels also have the copy_file_range syscall (with compatibility shim in glibc) which is supposed to use the most efficient copying approach available between any two file descriptors. So it's more general than splice or sendfile.

modells · on Aug 1, 2018

There is a ruby gem for Linux called io_splice that does zero-copy IO. Hasn’t been updated in a while but it doesn’t have any dependencies other than modern Linux and doesn’t mean it won’t work. “Old” code that works still works, novelty, job-securitization and API churn be damned when it doesn’t add value.

https://rubygems.org/gems/io_splice/versions/4.4.0

http://www.bigfastblog.com/zero-copy-transfer-data-faster-in...

EDIT: source to current stable coreutils’ cat http://git.savannah.gnu.org/gitweb/?p=coreutils.git;a=blob_p...

Rapzid · on Aug 1, 2018

The most interesting thing about all this to me, other than the existence of splice(I really should finish The Linux Programming Interface), is that you need a pipe and two splice operations to get the data between other file types.. There must be some dirty implementation detail forcing this right? Right?!

mre · on Aug 1, 2018

splice is implemented as a pipe, that's why. I think it's a beautiful design because pipes have been around forever and they just work.

mre · on Aug 1, 2018

>Windows doesn't provide zero-copy file-to-file transfer (only file-to-socket transfer using the TransmitFile API).

Anybody knows if the Windows TransmitFile API can also be used to make file-to-file copies?

Arnavion · on Aug 1, 2018

It takes a socket handle.

type0 · on Aug 1, 2018

Great educational post, but really someone needs to make a long awaited version called long cat!

mcguire · on Aug 1, 2018

Tl;dr: splice() as a Linux-only, zero-userspace-copy, file-descriptor to file-descriptor copy that has to use pipes for one FD.

Interesting, but less than earthshaking.