Finding and Understanding Bugs in C Compilers

WalterBright · on March 28, 2011

I typed in the examples from the paper and tried them out with the Digital Mars C compiler. They all passed, as well as http://blog.regehr.org/archives/482. I hope the authors will try to break dmc with their test suite generator, and look forward to fixing any issues they find.

yan · on March 28, 2011

I emailed John Regehr asking when they're planning to publish Csmith, and he responded "Almost certainly sometime in April"

bokchoi · on March 28, 2011

Cfront or Csmith? :)

yan · on March 28, 2011

Er, Csmith, sorry;fixed.

beza1e1 · on March 28, 2011

I checked with cparser, too. No issues here as well. Waiting for the release of csmith.

regehr · on March 29, 2011

In the meantime, here's one:

  regehr@home:~/volatile/bugs/tmp007$ cparser --version
  cparser (0.9.11) using libFirm (1.18)
  This is free software; see the source for copying conditions.  There is NO
  warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

  regehr@home:~/volatile/bugs/tmp007$ cparser -c small.c
  cparser: ast2firm.c:3197: conditional_to_firm: Assertion `get_irn_mode(false_val) == mode' failed.
  Aborted
  regehr@home:~/volatile/bugs/tmp007$ cat small.c
  unsigned char foo (int ui1, unsigned char ui2)
  {
    return ui2 ? : ui1 + ui2;
  }

beza1e1 · on March 29, 2011

Well, that isn't valid C99 or even C89. It's yet another GNU extension to support. Thanks for the report, though!

beza1e1 · on March 30, 2011

BTW the bug is fixed in the repository version.

russell_h · on March 28, 2011

The remarks about coverage are interesting, and lends support to something I was thinking about a while ago (maybe this is common knowledge):

If I'm writing test cases with the goal of increasing (line) coverage, I first look for large sections of code that aren't yet covered, then test the most obvious path to reach those lines (usually its some sort of special case or error handler). Sometimes this catches really simple mistakes, but generally those lines were written specifically to handle the the path I'm testing. The problem is that once I've tested that path I'm inclined to move on to un-covered lines, and don't test a lot of paths that exercise the same code, but that aren't necessarily obvious to me. Such paths, of course, tend to be more likely to contain bugs because they weren't as obvious to, or considered as critical by whoever originally wrote the code.

phiggy · on March 29, 2011

Some kind of mutation testing[1] might be helpful here.

1: https://secure.wikimedia.org/wikipedia/en/wiki/Mutation_test...

_3u10 · on March 28, 2011

Look into Microsoft's CHESS proejct which analyzes the code to quickly converge on error cases as well as a system by which thread interleaving is predictable and thus testable. It actually tests every possible combination of thread interleaving through a code path.

zwp · on March 29, 2011

Also the fantastic Microsoft Pex: http://research.microsoft.com/en-us/projects/pex/

_delirium · on March 28, 2011

As someone who works in generative algorithms for expressive/art/content purposes, the phrase "generating random programs that are expressive" is interestingly evocative. I wonder if there are any lessons from the past ~5 years of "fuzzing" algorithms for generative-art / generative-AI to mine.