Moving the Linux Kernel to Modern C

progbits · on Feb 24, 2022

From [1]:

> You are not "introducing" a new macro for this, you are modifying the existing one such that all users of it now have the select_nospec() call in it.

> Is that intentional? This is going to hit a _lot_ of existing entries that probably do not need it at all.

> Why not just create list_for_each_entry_nospec()?

Let's ignore whether this patch is needed or works for now - I don't feel competent to comment on that. But this is such a bad suggestion. Instead of fixing the default to be safe, and possibly having a _unsafe_but_fast() variant for the places where it makes sense they want to keep the broken version and require user to explicitly opt in into safety.

Same as the infamous PHP mysql_real_escape_string (don't make the mistake of using mysql_escape_string!) [2][3] or whole host of C stdlib footguns like strcpy/strncpy [4].

The default, easy, obvious option should be safe. The unsafe but faster option should be hard to use by accident and obviously marked as such from the name.

[1] https://lwn.net/ml/linux-kernel/Yg6iCS0XZB6EtMP7@kroah.com/ [2] https://www.php.net/manual/en/function.mysql-escape-string.p... [3] https://www.php.net/manual/en/function.mysql-real-escape-str... [4] https://en.cppreference.com/w/cpp/header/cstring

klodolph · on Feb 24, 2022

This is just how you work with large code bases. Updating 15,000 call sites is far from trivial. These changes happen in multiple phases, and in each phase, you still want to have a working kernel.

The unsafe version can be deprecated and eventually removed, but that is down the road.

mamcx · on Feb 25, 2022

You can done it, without complex tooling:

1. Have 3 versions of the thing to change: select_nospec, select_nospec_safe, select_nospec_unsafe.

select_nospec_unsafe is identical to select_nospec. Do the mass replace. All stay same.

2. Delete select_nospec

3. Start migrating to select_nospec_safe. You can still do this in small steps, because you old thing stay.

4. Where select_nospec_unsafe must be, keep it and maybe mark with a comment or similar to not delete it later

5. You finish and all is changed.

Maybe replace select_nospec_safe to select_nospec if wanna.

slimsag · on Feb 25, 2022

When there are hundreds of branches, not just feature branches but real long-lived development branches, and pull requests are managed the way they are in the Linux kernel (chain of trust all the way up to Linus).. "Do the mass replace" becomes quite non-trivial I imagine.

kragen · on Feb 25, 2022

This strategy handles that fine as long as none of those branches lasts from before step 2 until the optional step 6. You pull from a long-lived branch, get a compile error because it calls select_nospec, do the replacement at that callsite, and commit. A headache but a manageable one. If you want to backport new features to old stable kernel versions, you can include a patch supplying select_nospec_unsafe as a synonym for select_nospec.

eru · on Feb 25, 2022

Depends on who you want to put the burden on?

It's relatively easy to do the mass replace in master. You leave the burden of the replacement in the other branches to people who own those branches.

Set up an automated script, to keep the 'unreplaced' version out of master (and people can also use that script to check their own branches).

Not sure if that counts as non-trivial, yet?

There are probably some other trade-offs involved. The people running kernel development ain't idiots.

tjoff · on Feb 25, 2022

> Not sure if that counts as non-trivial, yet?

Assuming perfect tooling and infrastructure doesn't sound like it necessarily have to be a big deal.

But, how far into fantasy land are we? How much effort would be required to get there? Is this and all similar problems large enough to justify it?

Is adequately tooling and infrastructure even possible?

Does that make proposed changes, such as moving to C11, order of magnitudes more difficult?

mamcx · on Feb 25, 2022

I not arguing is effort-free, but is effort-minimal. The idea allows, at most, one or two major steps, the first and the biggest is as simple as is possible and can be done in one go, and have zero possibility of breaking.

The only major issues is that the change is let half-way. But that is only possible with MAJOR undisicpline.

---

The major point, to me, is that this EXCESSIVE fear of breakage is not good. Yes, making upgrades is pain, but you CAN make the pain tolerable with some planning.

Refactoring is like exercise for code: Everyone dislike the idea of excessive, but is GOOD.

And lets be honest, among all the things that could requiere a refactoring, this case is on the most simples of the simples scenario..

aliswe · on Feb 25, 2022

The branches wont compile until you fix every instance though, so its not that risky. just a bit messy.

mamcx · on Feb 25, 2022

And if the original version is not deleted in step 2 but after the final, then nobody break!

xiphias2 · on Feb 24, 2022

I’m not sure how it’s done with the Linux Kernel (and git definately makes it hard), but at Google (using Perfoce model) there are tools for mass renaming that try to guarantee that there are no regressions. Of course in Linux the amount of C macros make it much harder to work on the syntax trees, but comparision after the preprocessor step should be possible.

arka2147483647 · on Feb 25, 2022

The point is that the change is not a simple rename:

    struct foo *iterator;

    list_for_each_entry(iterator, &foo_list, list) {
     do_something_with(iterator);
    }

While the new version would be something like:

    list_for_each_entry_v2(iterator, &foo_list, list) {
     do_something_with(iterator);
    }

So the ’iterator’ variable may be removed, but as this is c, might also be reused multiple times, or come from some struct member, union, or whatever. So i presume something like this would have to be fixed case by case.

klodolph · on Feb 25, 2022

You can do these kinds of changes with clang-tidy, if necessary. Clang-tidy is extensible and you can write your own rules for transforming source code.

Mind you, there are not a ton of people around who have the time to learn how to write custom clang-tidy rules. It might be done for 15,000 changes that potentially fix kernel vulnerabilities, though.

slaymaker1907 · on Feb 25, 2022

Getting code analysis/refactoring tools to understand huge, complicated projects like the Linux kernel is far from trivial.

klodolph · on Feb 25, 2022

In practice it is not that hard. The refactoring tool may only need to tackle one translation unit at a time. You create a pattern which matches some usage of a macro, and create some logic to write out some changes to the AST. It can involve a surprisingly short amount of code—because clang-tidy has good tools for writing patterns and modifying ASTs!

You can brows the clang-tidy source code yourself. There are existing checks specific to the Linux kernel in there already. Well, there’s one check. But if you use clang-tidy, you’ll discover that automatic refactoring of extremely large code bases is within reach.

https://github.com/llvm/llvm-project/tree/main/clang-tools-e...

The only question is whether you would want to spend the staff hours working on a clang-tidy check. Large code bases are exactly where the tradeoff makes sense.

jabl · on Feb 25, 2022

Kernel developers have previously used a tool called coccinelle to do these kinds of mechanical mass changes. It's pretty nifty.

pitaj · on Feb 25, 2022

Unless something has changed in the last month, clang-tidy is not extensible. You can't write your own rules without forking the whole project.

klodolph · on Feb 25, 2022

I’ve worked at a couple companies that had custom clang-tidy checks. This is well-documented enough at this point, but it is still a bit arcane.

http://bbannier.github.io/blog/2015/05/02/Writing-a-basic-cl...

Yes, it involves forking. Forking isn’t that big a deal. We even had checks that were specific to internal libraries.

andi999 · on Feb 25, 2022

At the start forking is like no deal at all. But how do you get new clang tidy features from the main branch later into your fork; I think this is the only instance which I would really call technical debt.

klodolph · on Feb 25, 2022

“Git pull” works pretty well to bring new clang-tidy features downstream. If you write a custom check, it goes in its own set of files, so there will be no merge conflicts. The only thing that would break is when the internal clang-tidy API changes, but clang-tidy checks tend to be fairly short.

Really… I have worked at two companies that had extended clang-tidy to add custom checks for their code base. The whole point of clang-tidy is to automate code fixes in other parts of your code base. When done well, use of clang-tide pays down technical debt faster rather than adding to it.

plorkyeran · on Feb 25, 2022

Forking to add new checks only causes problems with pulling from upstream when APIs you are using are changed upstream, which happens rarely. Forking to _modify_ existing checks would cause you a lot of pain, but the solution there is to copy the existing check instead.

karlding · on Feb 25, 2022

You can also write your own standalone LibTooling [0] app, which accomplishes something similar [1].

[0] https://clang.llvm.org/docs/LibTooling.html

[1] https://clang.llvm.org/docs/LibASTMatchersTutorial.html

klodolph · on Feb 24, 2022

IIRC Google works the same way. The automated refactoring tools (Rosie?) may make it go faster, but you generally don't fix up 15,000 call sites in a single change... you break it up into smaller batches spread across the tree.

acidbaseextract · on Feb 24, 2022

It's been a long time since I was there, but I thought there were plenty of 15,000 call site refactorings done in a single final CL. Not that the Linux kernel should do the same!

skybrian · on Feb 24, 2022

When I was there (several years ago) it was rare to do that across many projects. You don't want to have to roll back everyone due to a problem that affects one project. Also, there's a risk that changes that are happening in the meantime might mean the patch doesn't apply.

Filligree · on Feb 24, 2022

For a global change? These days that 'risk' is more of a certainty.

ithkuil · on Feb 26, 2022

It may take a lot of time to get all the approvals from all the code owners. One of the big advantages of a tool like Rosie is precisely that it splits up the update into smaller CLs and has automation that nags all necessary reviewers and reruns the transformation once head moves.

deckard1 · on Feb 25, 2022

What does git or Perforce have to do with refactoring? Why would whatever version control you're using matter?

> tools for mass renaming that try to guarantee that there are no regressions

beyond sed or whatever a fancy IDE already does? What kind of voodoo magic are we talking about here? Did Google solve the halting problem and I just haven't noticed yet...

bri3d · on Feb 25, 2022

Disclaimer: I haven't worked at Google, but you can easily read their papers like https://dl.acm.org/doi/pdf/10.1145/2854146 and understand what's going on. Refactoring tooling is integrated with the revision control system because in a giant codebase with thousands of contributors, you need to split a refactoring patch into smaller chunks and ideally use semantic rather than text-based diff tooling, or the changes will never get reviewed or merged before they conflict with more changes.

As for the actual renaming, yes, definitely beyond what sed does but probably around what the fanciest IDE does. Imagine a big global symbol dependency graph produced by the entire build toolchain all at once and cached, I think?

Also, this isn't the halting problem unless your codebase allows for fully dynamic invocation :)

worthless-trash · on Feb 25, 2022

Aren't all function pointers dynamic invocation ? Or is "fully" one of those words that i'm not grasping in context.

DaiPlusPlus · on Feb 25, 2022

> Aren't all function pointers dynamic invocation?

Given that compilers can be smart-enough to detect "non-dynamic" function-pointer invocation (e.g. when an execution-trace proves that a function-pointer parameter always points to the same function address), it's not safe to say that "all" function pointer invocations are dynamic.

Another case to consider is when one implements (Smalltalk-style) OOP with message-passing: in many cases it's possible to build that without needing to use function-pointers at all.

eru · on Feb 25, 2022

Also you have a type system. And even in C function pointers have types.

So you can exclude many things that would in-principle be possible.

worthless-trash · on March 2, 2022

> C function pointers have types.

I'm not 100% sure, but i remember that i could have have both a two and three function argument be called with the same function pointer, its probably UB however i think that as long as any dereferenced functions adhered to the standard argument behavior of the platform specific calling convention it worked, and i dont think gcc, clang or msvc complained, but my memory might be wrong.

tialaramex · on Feb 25, 2022

The rename has to be atomic, which is one of the things your revision control system gives you, unless it's like CVS or something worse†. So, that's a necessary part of the solution space.

You want there to be two states: State A where the thing was called old_name and State B where the thing is called new_name. State AB where some code thinks it is called old_name but other code thinks it is called new_name is broken, and must not exist.

† This might seem outrageous to modern programmers, but CVS thinks in terms of files, so from its point of view it's fine if out of sixty files you tried to commit, 48 of them succeeded and 12 failed. Good luck fixing the resulting mess.

tonyarkles · on Feb 25, 2022

I remember having to go backwards from SVN to CVS for a project a long time ago, and I was so shaken by the fact that CVS couldn’t do atomic commits with multiple files!

bonzini · on Feb 25, 2022

Yeah, my favorite model of how git operates is "rewind all the way back to RCS, add atomic commits and take it from there".

SVN, baz/bzr and friends are what you get if you add networking before atomic commits. Git is what you get if you start with the proper data model and only then add networking.

plantsbeans · on Feb 25, 2022

I believe the commenter is referring to system where you take a giant “git sed” and automate a rather complicated process: 1. break up one huge diff into a set of many (hundreds or thousands) patches, where each individual patch only touches files in a particular subsystem. 2. send all those patches to the appropriate system owners and run the appropriate tests. 3. manage the actual merging of successful patches, as well as communicating to the original author any individual patches that might need more attention e.g. based on patch review feedback.

as you might imagine, it’s probably a pretty involved process to touch thousands of lines of code in a complicated system.

patmorgan23 · on Feb 25, 2022

There are multiple branches being actively worked on. It would be very easy for someone to murge in an unrefacatored branch and now half the code base is using the old call.

eru · on Feb 25, 2022

You'd want something like https://bors.tech/ to avoid that.

Bors works for git. Google has similar tooling for their system. (I think it's called the 'train'.)

staticassertion · on Feb 25, 2022

Linux doesn't even have consistent testing let alone the kind of tooling or will that Google does.

colin_mccabe · on Feb 25, 2022

The Linux kernel developers have pretty good tooling for refactoring. For example, they have Coccinelle and Sparse.

1. https://en.wikipedia.org/wiki/Coccinelle_(software) 2. https://en.wikipedia.org/wiki/Sparse

marcan_42 · on Feb 25, 2022

Linux definitely has testing. Some helpful person added our repo base to the kernel-test-robot list, and now I regularly get e-mails when something I'm working on in some random WIP branch broke the build on some other architecture (often in COMPILE_TEST mode), or some other random driver. Didn't even have to do anything myself. It's great for avoiding "this broke the build on $obscure_config" issues.

eru · on Feb 25, 2022

I don't see how Perforce makes this easier than Git?

interactivecode · on Feb 24, 2022

I thought this was one of the main advantages of typed languages refactoring or renaming is safe and easy. What gives?

kibwen · on Feb 24, 2022

While calling C a typed language is technically correct, I struggle to think of any typed language that's more weakly-typed than C. Arguably, Python has a stronger type system than C.

tapas73 · on Feb 25, 2022

"static vs dynamic" and "strong vs weak" are orthogonal coordinates.

Koshkin · on Feb 25, 2022

It seems GP was talking about the latter.

tyingq · on Feb 24, 2022

The C preprocessor is arguably a not type safe language unto itself.

Someone1234 · on Feb 24, 2022

The C preprocessor is a siren song, lures people in with the promise of "extra performance/features/structures for zero runtime cost!" but murders any readability or rationality within your codebase.

Cool concept, but once you've dealt with any legacy codebase that has used it extensively, it feels like an anti-pattern/footgun. Ultimately you just want the language to do those things directly and for the compiler to be smart enough to optimize it later.

Essentially it is too clever for its own good.

lupire · on Feb 24, 2022

No one likes the preprocessor, but it's practicaly required if you don't want to move to C++ with template metaprogramming.

badsectoracula · on Feb 25, 2022

I do like the preprocessor and my major pet peeve with C is that the preprocessor has been stagnant for ages. Like, why the hell can't i do something like

    #define BASE hey
    #define HEADERS foo bar baz
    #append HEADERS bad bal bah
    #push OSSUFFIX
    #ifdef WIN32
    #define OSSUFFIX win32
    #else
    #define OSSUFFIX unknown
    #endif
    #foreach H HEADERS
    #eval include "$(BASE)/$(OSSUFFIX)/$(HEADERS)"
    #endfor
    #pop OSSUFFIX

People go to great lengths making all sorts of weird structures from x-macros, repeated statements, defines that only exist to be used by other defines, etc all to work around existing preprocessor limitations - and many of them would simply become unnecessary if the preprocessor could do things like variable editing, loops and being able to eval its own commands.

Even though some stuff can be done via language features, it is often necessary and more flexible to work with the source code itself.

midjji · on Feb 25, 2022

One alternative is to just write a C code generator, which lets you mix C and some sensible language, eg. python. Then use that to generate the code which is then sent to gcc.

marcosdumay · on Feb 25, 2022

Well, C++ templates happen on the preprocessor too.

adrianN · on Feb 25, 2022

No they don't. Template instantiation happens during compilation, way later than the preprocessor.

kllrnohj · on Feb 25, 2022

Not really? They're part of compilation and obey all the type rules & syntax of the language. They're not textual replacements that run in a distinctly different phase like macros are.

jcelerier · on Feb 25, 2022

where in hell did you learn that, this is entirely false

__d · on Feb 25, 2022

Well, it is today.

When I first learned C++, using cfront in the late 80's, lots of the language was implemented as C pre-processor macros.

jcelerier · on Feb 25, 2022

maybe lots of the language, but not templates. Cfront 2 did not have templates, and Cfront 3 did have them without using the preprocessor.

You can even check out how it was done: https://github.com/seyko2/cfront-3/blob/master/src/template....

midjji · on Feb 25, 2022

Its not too clever for its own good, its programmers that are too clever for C, and desperately need basic features, and its the only way to get them unless you switch language.

Its also surreally poorly designed, encouraging worse habits than C itself. Avoiding definition collisions and lack of namespaces alone make it horrific for any moderate sized project and up. Combine that with the near complete lack of static analysis tools, and its a recipe for disaster.

P_I_Staker · on Feb 25, 2022

Yeah, I do agree. Interestingly, you could argue that it's fairly strongly typed; at least according to some definitions (and btw "type-strength" is not a rigidly defined term, so the definition itself is somewhat debatable).

I don't claim to be an expert in knowing what is/isn't in the various standards; I just look at the build errors and static analysis alarms. That said, I think the argument is that in the vast majority of cases it's not actually possible to change the type of a variable; off the top of my head, I can only think of pointer co-erosion. Otherwise, if you define an unsigned int, it stays an unsigned int.

Now strong/weak typing isn't necessarily the same thing as type safety. C has always seemed astonishingly bad on that front. It's like they try to trick novice developers into thinking they have a robust type system sometimes.

If you define functions implicitly they will link to any symbol, EVEN A VARIABLE... this is bananas. I think I sort of understand why compilers work this way, but it really feels like a bug in the language. Under some compilers you might not even get a warning for implicit function, either! TI had a compiler that hid them by default.

Enums are garbage in C. Again, you can misuse them, and may not even get a warning. You can pass an enum for color to a function that takes an enum for kittens and the compiler will be happy as a clam. The way they're often used can cause constant implicit conversions every time there's an assign/compare. May not be a problem for most positive values, but it's annoying if you're trying to develop standard compliant code. MISRA defines an "essential type system" and normal enums usage violates it.

I'm probably forgetting a TON of deficiencies. Please add them or correct me where I'm wrong.

lupire · on Feb 24, 2022

"arguably" is not needed. Preprocessor destroys most of the semantics and even some syntax of C.

klodolph · on Feb 25, 2022

That’s a good question, it’s not fair that you’re being downvoted.

It’s the C preprocessor that causes a mess. Tooling for C and C++, like automatic refactoring, lags behind tooling for languages like Java, C#, and Go. Refactoring tools have to deal with macros, conditionals inside #if/#else blocks, and header search paths.

In this case, the refactoring involves removing a variable from the enclosing scope of a macro invocation. The most likely way to automatically refactor it would be to write a custom check in clang-tidy.

eru · on Feb 25, 2022

Whether something is (statically) typed or not is more of a continuum than a binary.

Eg C tracks whether a variable is an int or a char. Haskell also tracks whether a function causes side effects or not. More sophisticated systems also track whether a function has to return eventually or could run forever.

jayd16 · on Feb 24, 2022

You can still have plenty of issues crop up from code that's not currently in master yet, or forked projects that take patches sparingly, etc.

Typed languages let you know whats broken at compile time and aides in refactoring code you can see. It doesn't help you refactor code you can't see.

joshuamorton · on Feb 24, 2022

Atomically flipping a version is still difficult, as anything that needs to rely on the other version has to be fixed forward. If you instead make 5000 (or 500) small changes that affect 10-15 uses, you can solve the unique cases or roll-back and forward.

ithkuil · on Feb 26, 2022

I wish languages would make it easier to decouple the "name" from the "code".

The unison language offers an interesting take on the problem of code evolution backward compatibility. https://www.unisonweb.org/

kache_ · on Feb 25, 2022

lots of ways to do this. codemod, ratchets. It's a solved problem

hyperpallium2 · on Feb 25, 2022

First, do not break.

Koshkin · on Feb 25, 2022

Second, if it ain't broke, don't fix it.

rfoo · on Feb 25, 2022

Third, accept that when someone found your code is broken, they don't have to tell you.

masklinn · on Feb 24, 2022

> Same as the infamous PHP mysql_real_escape_string (don't make the mistake of using mysql_escape_string!) [2][3]

TBF that is really a straight bridge to the mysql C API, which is why it’s also in mysqli.

MySQL actually has a third one called mysql_real_escape_string_quote, because if the sql mode NO_BACKSLASH_ESCAPES is set the escaping function needs to know what context it’s used in, and thus what quote to double up, and mysql_real_escape_string will fail.

account42 · on Feb 25, 2022

And of course, the real answer to wether you're supposed to use mysql_escape_string, mysql_real_escape_string or mysql_real_escape_string_quote is "why the fuck are you pasting queries together using string processing".

stephenr · on Feb 25, 2022

I get the point you’re trying to make, but the blame doesn’t really lay with php.

Like most things introduced in early php, the mysql extension was just wrapping c functions.

The concept of “real” escape comes from mysqls’ C API: https://dev.mysql.com/doc/c-api/5.6/en/mysql-real-escape-str...

marcosdumay · on Feb 25, 2022

That's the deal with backwards compatibility. Even third parties may depend on that name, it's not safe to go and change it.

On userspace that's the time when you deprecate an entire module and switch everything into a new name. The kernel does that once in a long while, but it's a bit harder for them.

marcan_42 · on Feb 25, 2022

The kernel doesn't care about third parties though, that's why the kernel API/ABI is not stable. This is one big advantage to doing it that way: sure, you need to have a consistent migration plan in flight to make sure you don't break mainline in the process, but you can immediately remove the old way without having to care about third parties, once you're done.

toast0 · on Feb 24, 2022

> Same as the infamous PHP mysql_real_escape_string (don't make the mistake of using mysql_escape_string!) [2][3] or whole host of C stdlib footguns like strcpy/strncpy

In all of those, the enhanced function takes more paramaters. Switching the existing name to require a new parameter will break existing code, and people will not want to update. Making the new parameter optional is possible, but IMHO is messier than a new function that has required parameters and then deprecating the old function (and eventually removing it).

midjji · on Feb 25, 2022

You are making the rather optimistic assumption that here is a single definition of the macro. There are likely dozens, most of which share a name and randomly overwrite each other, others share the name but are subtly different. Macros are a truly terrible way to metaprogram. In my experience, if you spot one in a code base and can remove it, you have removed a bug you never knew you had, you spot one and change it, you have broken more things than you will ever know.

kazinator · on Feb 24, 2022

C99 has one nice feature compared to C89 that is little talked about: you can initialize local aggregates using expressions that can't be calculated at load-time.

   void fun(int x)
   {
     struct foo = { x };       // not allowed in C89
     int bar[3] = { 0, x };    // ditto
   }

Chances are the kernel does this because, I think, it's also a GNU89 extension.

marcan_42 · on Feb 25, 2022

Linux already uses a lot of C99 features that were GNU extensions before C99, like designated initializers. So in practice, Linux is already C99, and hasn't been buildable on strict C89 compilers in a long time.

rrauenza · on Feb 25, 2022

I'm trying to understand how this linked list works --

    struct list_head {
       struct list_head *next, *prev;
    };

    struct foo {
       int fooness;
       struct list_head list;
    };

    struct foo *iterator;

    list_for_each_entry(iterator, &foo_list, list) {
        do_something_with(iterator);
    }

If we are walking iterator->list->next ... how do we get the pointer of the next enclosing foo struct? Are they doing pointer arithmetic to get the beginning of the struct from the list field offset and casting it to foo?

Ah -- That does seem like what they do:

    #define list_for_each_entry(pos, head, member)              \
        for (pos = list_entry((head)->next, typeof(*pos), member);  \
            &pos->member != (head);    \
            pos = list_entry(pos->member.next, typeof(*pos), member))

    #define list_entry(ptr, type, member) \
         container_of(ptr, type, member)

    #define container_of(ptr, type, member) ({          \
         const typeof( ((type *)0)->member ) *__mptr = (ptr);    \
         (type *)( (char *)__mptr - offsetof(type,member) );})

kragen · on Feb 25, 2022

Yeah, standard hack for intrusive linked lists in C and similar languages.

dddnzzz334 · on Feb 26, 2022

How do I learn all these pointer hacks in C?

db48x · on Feb 26, 2022

https://en.cppreference.com/w/cpp/types/offsetof

userbinator · on Feb 24, 2022

I'm one of those people who think OS kernels should stay as portable and simple as possible (i.e. C89 or some other easily-bootstrappable language, to avoid Ken Thompson attacks), so this isn't great news to see "the ladder being pulled up another rung". Then again, Linux has already become immensely complex.

pm215 · on Feb 24, 2022

Linux has always relied heavily on GCC extensions, though -- it makes no attempt to actually be C89-compliant portable code. What it actually has is a minimum supported gcc version, which in turn governs whether particular features can be used. In this case the minimum gcc version has for other reasons finally got big enough that C99 and C11 support is definitely present -- the ladder was already this high.

(You can also build with clang, but only because clang deliberately aims to support most gcc extensions.)

pcwalton · on Feb 24, 2022

The advantages of using a newer language greatly outweigh the disadvantages of theoretical "reflections on trusting trust" attacks. By mandating C89, you're condemning thousands of kernel developers to use a language that's over 30 years old because of a theoretical attack that has never happened and seems practically implausible. Does anyone really think there's a backdoor in both GCC and Clang (remember, Linux can be compiled on either)?

gmadsen · on Feb 24, 2022

I am ignorant on nearly all compiler related issues, but there was an article on here not too long ago that was arguing that nearly all os development required old C because choices of the committee would break use cases required under the guise of undefined behavior.

is the benefits of "modern C" worth compile times 2-3x times longer?

pcwalton · on Feb 24, 2022

Times like these I wish I were allowed to say exactly how much money big companies save by using the newest versions of GCC and Clang. The economic value of modern compiler optimizations is staggering.

tomcam · on Feb 24, 2022

I hereby give you permission to say exactly how much money big companies save by using the newest versions of GCC and Clang.

pcwalton · on Feb 25, 2022

Thanks, but it's my employer's permission I'm concerned with. :)

bch · on Feb 24, 2022

> wish I were allowed to say exactly how much money big companies save[...]

Are you able to give hints that would guide us in thought exercises?

bluGill · on Feb 24, 2022

Facebook has hinted that they employ smart C++ people because some core optimizations can save on the order of several hundred thousand dollars per year (possibly in the millions). Most of that is off because they don't have to buy as much power to cool the server rooms, some of it is buying less servers as what they have can do more. Facebook doesn't actually say what they save, but they have said they measure it, and we know how much it costs to employ an engineer full time working only on optimization, and we know who some of those engineers are.

You have to be very large to notice it, but the likes of Facebook, amazon, and Google have massive warehouses almost entirely filled with computers. It doesn't take much to see how their power bill can add up.

plorkyeran · on Feb 24, 2022

Years ago Alexandrescu said that Facebook estimated a .1% speedup to HHVM would save them $100k/year. It's presumably only gone up since then, but he's stopped giving an actual number in talks.

ForOldHack · on Feb 26, 2022

I would argue that he is bragging, and that the amount of code optimization is minimal. Large companies love to brag about how many servers they have, and optimizing compilation is just a buzz word. Go ahead, buy another servers. $100k is nothing. Its less than 1 engineer year vs 24,000 engineer years.

The Distribution collectors, they brag about speed, and optimization is very important to them. They want to be able to test changes very quickly.

steveklabnik · on Feb 24, 2022

The parent works at Facebook/Meta leading their Rust team, focusing on compiler and ecosystem improvements. So that in and of itself is a sort of hint into what that info could be like, even if it's not actual details of the order of magnitude or anything.

______-_-______ · on Feb 24, 2022

The Google Search optimization team measures changes in terms of millions of CPUs. So if upgrading your compiler saves even 0.1%, well, that's a lot of servers you don't have to buy next year.

ForOldHack · on Feb 27, 2022

Apple went out and bought a CRAY.

"Jun 30, 1987 — The Cray is part of a $20m installation used by Apple's Advanced Technology group and consists of four CPUs operating at 9.5nS per cycle,"

Now we have distcc, that can both massively parallelize compilation, and optimization.

Googles team probably shows some slight improvement, just to justify their work, but in the long run, its just as sloppy as industry wide coding is.

Microsoft is the absolute worst, along with Apple.

pertymcpert · on Feb 24, 2022

One avenue is to think about it in terms of datacenter compute. If modern compiler optimizations improve performance by 10%, aggregate the perf improvements across the entire world's compute and you get a huge saving.

aw1621107 · on Feb 24, 2022

> but there was an article on here not too long ago that was arguing that nearly all os development required old C because choices of the committee would break use cases required under the guise of undefined behavior.

I would guess that you're referring to either "How ISO C became unusable for operating systems development" ([0]) or "How One Word Broke C" ([1]).

[0]: https://arxiv.org/abs/2201.07845 , most recent HN discussion at https://news.ycombinator.com/item?id=30022022

[1]: https://web.archive.org/web/20210307213745/https://news.quel... , HN discussion at https://news.ycombinator.com/item?id=22589657

mananaysiempre · on Feb 24, 2022

Most of the really annoying UB is technically in C89 though, it’s just that compilers haven’t really treated it with such contempt for the first two decades or so. I can’t even recall any new kinds of (non-library) UB in C99 or C11 (though there have to be some).

So “old C” in such a case would need to mean “an old C implementation” (or possibly a new one, but simple or configured to behave like an old one), something like GCC 2.8 maybe, and nobody’s using that on desktop. So the language standard version should be mostly immaterial, and it’s not like the C89-to-C17 difference is anything like the yawning C++98-to-C++20 chasm. (This is a carefully phrased statement: C99 had complex numbers, which are annoying, and variable-length arrays, which are a significant change, but C11 demoted both to optional features.)

neysofu · on Feb 24, 2022

> is the benefits of "modern C" worth compile times 2-3x times longer?

I mean, you get more security by default and security is pretty darn important for kernels...

dataangel · on Feb 25, 2022

I think you're mixing up two different things. Newer standard versus newer compiler. All versions of the C standard have lots of undefined behavior. Newer compilers contain optimizers that are just better at leveraging it.

bonzini · on Feb 24, 2022

Why would a different standard cause longer compile times?

db48x · on Feb 25, 2022

If the spec requires that the compiler look for a certain error and produce a diagnostic, then the compiler has to spend time doing that. The more complicated the spec is, the more work the compiler has to do to find the errors.

For example, Rust is frequently said to have long compile times. There are a number of interesting reasons why that is often true, but if we ignore the pathological cases then what we find is that borrow checking takes a significant fraction of that compile time. This is a big trade–off between features and complexity that the Rust language made very deliberately: the advantages of the borrowing rules are what makes Rust such a great language. The cpu time spent checking that those rules have been followed are a small price to pay, but not a negligible one.

midjji · on Feb 25, 2022

Is there a single such which you cannot disable by compiler flags?

db48x · on Feb 26, 2022

You cannot disable the borrow checker in Rust with a compiler flag. You might as well try to turn off the type checker, or ask it to ignore syntax errors.

Dylan16807 · on Feb 24, 2022

Declaring a variable in the middle of a function takes basically no effort to support. Especially compared to all the ridiculous things the kernel gets up to.

The ladder's being moved a millimeter.

neysofu · on Feb 24, 2022

Ken Thompson's hack is purely theoretical, especially in the modern computing landscape where the diversification of software supply chains would make such hack much 1. less effective and 2. easier to detect. If we're talking about kernel security, there are other lower hanging exploits, many of which are enabled by outdated language design decisions and unsafe memory models.

As compiler technologies evolve, it becomes not only a necessary evil but rather a better course of action to trust compilers rather than humans tiptoeing around security risks masquerading as language idiosyncrasies.

yjftsjthsd-h · on Feb 24, 2022

> Ken Thompson's hack is purely theoretical

Erm. I was always under the impression that he had actually done it, and that his presentation was a historical anecdote. Is that not the case?

neysofu · on Feb 24, 2022

Yes, he did, but that just shows that it's possible to inject self-replicating code in a compiler. It doesn't actually tell us how resistant such code is to compiler source changes, audits, binary analysis, etc. which are all required for a successful exploit.

boogies · on Feb 24, 2022

But the University of Minnesota experiments prove at least some malicious code is capable of passing at least standard Linux code review, right?

gmfawcett · on Feb 24, 2022

There's a world of difference between that and Thompson's attack. You don't exploit the kernel code: you exploit the compiler code, such that every program it compiles (a) is compromised, and (b) is incapable of detecting the exploit in programs compiled by the same compiler.

IshKebab · on Feb 24, 2022

That's a completely separate issue. The trusting trust bug requires you to convince people to use your compiler binaries. The thing you're talking about just requires a patch to pass code review.

It's more work - you have to figure out a sneaky bug and write a legit looking patch - but anyone can do it. You don't have to be in a position of power already (e.g. being the Debian GCC packager) so overall it is much easier.

josefx · on Feb 25, 2022

I think they only got past a single reviewer and counted that as success, none of their harm code made it into the kernel proper. However the University also committed thousands of automated "fixes" for linter warnings that weren't part of the study and the kernel maintainers ended up reverting all of them due to the Universities overblown claims.

lonjil · on Feb 25, 2022

No, no malicious code passed. All malicious code was rejected before making it into the kernel.

lupire · on Feb 24, 2022

Parent didnt say anything about patch code review.

steveklabnik · on Feb 24, 2022

On a technical level, the hack has been demonstrated to work, but actually hacking someone via the technique has not, in my understanding.

marcan_42 · on Feb 25, 2022

It's obvious that the hack would work on a technical level, that's fundamental to the whole idea. The question is how complex the logistics of doing it in the real world would be; how you insert the tampered code, how you avoid anyone noticing, etc.

For example, I run Gentoo Linux, and haven't reinstalled my OS since 2004 or so. That means that, modulo a few binary packages, I have a direct source lineage to the state of Linux in 2004. If you want to pull off that attack against my system (and you didn't already back in 2004), you'd have to tamper with source archives. That would both imply changes that are easy to analyze (more than binary patches), and it would involve changing the archive hashes in the Portage tree. That tree is in Git, which means that it would create an immutable public record of what happened (Git is the original blockchain, remember), modulo forced pushes which people would, again, notice all over the place.

In practice, if you want to persistently backdoor a new system (supply chain attack), it's usually easier to do that in hardware or firmware than trying to do a RoTT attack on the distro and its compiler. In fact, it's users of binary distributions (or proprietary OSes) that should be more worried, as it is much easier to do a binary-based RoTT attack that self-updates to handle new versions consistently when all your users run the exact same binaries. Source code users should be more worried about compromise upstream than local persistence. And those attacks are a review / auditing issue, unrelated to RoTT.

In the end, if you are worried about being personally targeted, it's easy enough to make that impractical by re-bootstrapping your computing from an unpredictable source (e.g. walk into a random shop and buy a PC, walk into a net cafe and download your favorite distro and check the hashes there). And if you are worried about large-scale attacks, RoTT style ones aren't practical without someone somewhere noticing; you should be worried about traditional compromise instead.

truffdog · on Feb 25, 2022

Does https://en.m.wikipedia.org/wiki/XcodeGhost count? It's not targeting compiler developers, so it can't worm forever, but it is a malicious compiler that weaponizes its output.

dreamcompiler · on Feb 24, 2022

As far as we know.

Thompson's hack relies on the Halting Problem, and the space for deviousness within the Halting Problem is infinitely large.

rurban · on Feb 25, 2022

In this case practical attacks are very likely, due to insecurities added with C11 and the stoneage review model of Linux development.

C11 added insecure Unicode identifiers, and you won't find them in reviewing them manually via email. You need a special linter to detect such Unicode attacks. Or a proper development environment.

Compiler development usually evolves into more bugs, not less. Just now they caught up with the hundreds of bugs they added with gcc-9. Not talking about const, restrict, and strict aliasing, and the still not existing -Oboring for the kernel. Would you dare to use -O3 and -flto and -fstrict-aliasing in the kernel?

marcan_42 · on Feb 25, 2022

Linux has a checkpatch.pl. If it doesn't already test for Unicode identifiers, it would be trivial to add that. This is a non-issue. You don't ban a new version of a language just because it allows some undesirable things; it's trivial to ban those things specifically, either with scripting or specific compiler option overrides.

Heck, Linux already uses GCC plug-ins to implement some fancier security stuff; it is silly to think they can't handle forbidding Unicode identifiers.

jcranmer · on Feb 24, 2022

Linux is not written in C89. It is written in gnu C89, which is a mixture of C89, C99, and a panoply of often poorly-defined gcc-specific features. The number of compilers that can successfully compile Linux is one; not even clang is able to fully do so yet, I believe.

Actually, as a compiler writer, I'd go a little bit further and point out that Linux itself isn't even written to the gnu C89 very well; it's often written to a "C is portable assembly" view of the language, which results in nasty grams and invective being hurled at compiler writers if they compile the C specification correctly and not according to the "proper" assembly the code author thought they were getting.

One of the benefits of more modern language revisions is that they actually tighten the wording on a lot of the more ambiguous parts of the specification--C11 in particular adds a much more comprehensive memory model that's very shrug in the older revisions of C.

electroly · on Feb 24, 2022

Clang can do it. Google ships clang-built Linux kernels in Android and ChromeOS.

seabrookmx · on Feb 25, 2022

Also the Kernel used in the new Valve Steam deck.

lupire · on Feb 24, 2022

Can clang compile vanilla kernel without Google's forks?

elevader · on Feb 25, 2022

AFAIK it can. They actually spent quite a considerable amount of effort implementing all of the gcc extensions to do so. There is even documentation for that: https://www.kernel.org/doc/html/latest/kbuild/llvm.html

electroly · on Feb 24, 2022

Apparently OpenMandriva also uses a clang-built kernel, presumably not a Google fork. I have no personal knowledge about OpenMandriva though.

karlding · on Feb 25, 2022

Do you consider the ClangBuiltLinux project [0] a Google fork?

[0] https://clangbuiltlinux.github.io/

gsnedders · on Feb 25, 2022

Clang 9.0+ & Linux 5.3+ work for x86_64; I believe it's been possible to compile arm64 for longer.

nikanj · on Feb 24, 2022

My understanding is that Linux has often faced issues where ”compiling the specification correctly” means “Ha-ha gotcha, we can actually throw half your code out of the window because you misread the deep aliasing rules on page 8432 of the spec”

Their hesitancy with newer standards is understandable, when viewed against that backdrop

jcranmer · on Feb 24, 2022

One of these days, I will get around to writing my post as to why that take on undefined behavior is completely wrong.

More to the point, though, the only changes to undefined behavior in the C specification in newer versions (compared to C89) are either clarifying things that were already undefined behavior (e.g., INT_MIN % -1) or actually making some undefined behavior well-defined (e.g., allowing type punning via unions).

comex · on Feb 25, 2022

C23 will make calling realloc with a size of 0 into undefined behavior. [1]

[1] http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2464.pdf

bonzini · on Feb 25, 2022

That falls under things that were already undefined or impossible to use properly before.

Older standards said "it is implementation-defined whether the old object is deallocated". This didn't really work well:

- if realloc(..., 0) returns NULL if it freed the object, then you have confusion with error cases. strtol already has this kind of interface and it's unusable

- if realloc(NULL, 0) returns NULL and does nothing it does something different than malloc(0). Some chose to make it do something different, some chose consistency with malloc.

- if you choose consistency with malloc then realloc(NULL, 0) likely will end end up inconsistent with realloc(ptr, 0) where ptr is not NULL. On BSDs the two are consistent but also different from any other platform, so portable code could not rely on realloc(..., 0) doing something known: either you had possible double-free bugs on some platforms, or you had a memory leak.

plorkyeran · on Feb 25, 2022

I'm very unhappy that they chose to make it UB rather than mandating a behavior, but in portable C89 calling `realloc(ptr, 0)` is always a bug in your code, and it was an insane thing to declare implementation-defined originally.

syncsynchalt · on Feb 24, 2022

IMO the C standards are conservative enough that even a standard level "only" a decade old is still fine.

With that said, we still have options even when moving to a newer standard. Many new language features can be machine-translated to C89 if needed (similar to how we have protoize / unprotoize, though not always as seamless). If we need to keep a bridge to C89 the kernel authors could hold to a subset of new language features that are amenable to machine translation.

mlindner · on Feb 25, 2022

You're confusing portability with age. Just because something is old does not make it more portable. It arguably makes it less portable as new platforms will not support old language versions.

wolrah · on Feb 25, 2022

> I'm one of those people who think OS kernels should stay as portable and simple as possible (i.e. C89 or some other easily-bootstrappable language, to avoid Ken Thompson attacks)

Why does the complexity of the OS, or even its ability to be compiled by multiple compilers, matter for "trusting trust" attacks?

As I've always seen it, the problems and solutions all exist at the compiler level. The whole premise relies on starting with a binary compiler that you are expected to trust. The source is also assumed to be safe and un-tampered for purposes of this discussion because that's an entirely different issue.

The solution, of course, is to build the compiler itself with different compilers. If you build the same compiler with two or more different compilers, then use that to compile itself, you should be able to with the right options (see the work that has been done on reproducible builds) get binaries that are equal or close enough to easily compare any differences.

At that point if they are the same then you know either it's good or both of the upstream compilers were also compromised.

If for whatever reason that's not practical at the top level compiler the same concepts apply going further back in history until you get to some early compiler a bored grad student wrote in the '80s in pure ASM.

As a result from a practical sense I don't really see "trusting trust" attacks to be that big of a concern. It's always possible to work your way back down the tree of software until you get to a point where you can actually trust a compiler and then build your way back forward from there.

If a compiler starts depending on its own tricks and gets to a point where it can only be successfully compiled by itself, then there are reasons to be suspicious. Even then you'd just have to have the last version to be able to be built by other packages as one more stop along the way to trust.

thestoicattack · on Feb 24, 2022

The article does mention at least one advantage, so it's not as if the ladder is being pulled up for no reason.

Zababa · on Feb 25, 2022

> to avoid Ken Thompson attacks

I was always under the impression that the "message" of Reflections on trusting trust was that you have to trust someone at some point.

jml7c5 · on Feb 28, 2022

Does trusted bootstrapping really become that much more difficult? If you're already doing a trusted bootstrap to an old version of GCC, you can use that to build a modern GCC.

tomcam · on Feb 24, 2022

> so this isn't great news to see "the ladder being pulled up another rung"

Man I so vibe with that. But the new Cs do have a ton of new features and, more to the point, I trust Linus to make this kind of decision more than just about anyone.

mwint · on Feb 24, 2022

What's a Ken Thompson attack?

nl · on Feb 24, 2022

Ken Thompson's "cc hack" - Presented in the journal, Communication of the ACM, Vol. 27, No. 8, August 1984, in a paper entitled "Reflections on Trusting Trust", Ken Thompson, co-author of UNIX, recounted a story of how he created a version of the C compiler that, when presented with the source code for the "login" program, would automatically compile in a backdoor to allow him entry to the system.

https://www.win.tue.nl/~aeb/linux/hh/thompson/trust.html

The paper's a pretty entertaining read:

> First we compile the modified source with the normal C compiler to produce a bugged binary. We install this binary as the official C. We can now remove the bugs from the source of the compiler and the new binary will reinsert the bugs whenever it is compiled. Of course, the login command will remain bugged with no trace in source anywhere.

febstar · on Feb 24, 2022

Most likely referring to "Reflections on Trusting Trust" [0]; i.e. when the compiler is itself the attack vector.

[0]: https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...

Jtsummers · on Feb 24, 2022

https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...

Ken Thompson, Reflections on Trusting Trust.

EDIT: I don't think I've seen 5 nearly simultaneous replies sharing the same link before.

ksec · on Feb 24, 2022

>EDIT: I don't think I've seen 5 nearly simultaneous replies sharing the same link before.

LOL I was searching for a non-PDF link and delayed the reply. It would have been 6 simultaneous answer.

bombcar · on Feb 24, 2022

https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...

How do you prove your stack is secure?

gmfawcett · on Feb 24, 2022

Solution: the compiler for the microlanguage at the bottom of the stack is none other than the kindly and incorruptible American film star, Tom Hanks.

esarc · on Feb 24, 2022

https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...

ksec · on Feb 24, 2022

Reflections on Trusting Trust - Ken Thompson

https://wiki.c2.com/?TheKenThompsonHack

mindcrime · on Feb 24, 2022

I believe the person you are replying to is using that as an allusion to the issues discussed in the famous "Reflections on Trusting Trust" paper[1] by Ken Thompson.

[1]: https://www.cs.cmu.edu/~rdriley/487/papers/Thompson_1984_Ref...

pinephoneguy · on Feb 24, 2022

Yes but Linux won't even build with gcc 4 much less weird compilers like tcc or msvcc. That ship has unfortunately already sailed.

tiahura · on Feb 24, 2022

Sounds like GNU/Linux is probably warranted.

yjftsjthsd-h · on Feb 24, 2022

Kind of, but you can also use clang. On the other other hand, that's clang with GNU extensions, so /shrug

thetic · on Feb 24, 2022

Though C89 does not support declaration of a scoped loop variable like C99 does:

    for(int i = 0; i < 10; i++) {
        // ...
    }

C89 does support declaration of variables at the top of braced blocks whose scope is limited to that block:

    {
        int i;
        for(i = 0; i < 10; i++) {
            // ...
        }
    }

cesarb · on Feb 24, 2022

That extra scope is an issue for what they're doing, however. They have a macro called "list_for_each_entry()" which syntactically behaves as if it were a "for(...)" (because it is a "for(...)"). To make the extra scope work with that macro, either the user of the macro would have to end the loop with an unbalanced amount of closing braces (to match an extra opening brace within the macro), or it would require a second macro to be used at the end of the scope. Keep in mind that this macro is used in literally thousands of files within the kernel.

thetic · on Feb 24, 2022

Sure. This may not be a feasible approach for this case. I'm just pointing out alternative approaches to mitigate this kind of scoping problem in C89. Maybe wrap every use of list_for_each_entry in braces?

    {
        list_for_each_entry(...)
    }

spc476 · on Feb 24, 2022

Then you get dinged by static analysis tools like SonarQube because you introduced a useless scope or increased the "cognitive complexity" to the code and have to explain it to a team leader who might not understand why you did that.

hn_go_brrrrr · on Feb 24, 2022

A team lead who does not understand that does not have the technical chops to be a team lead.

lazide · on Feb 25, 2022

Doesn’t mean they aren’t still your team lead though!

staticassertion · on Feb 25, 2022

Should we really be making decisions with people like that in mind?

lazide · on Feb 25, 2022

If they are your team lead, would you have a choice?

jrockway · on Feb 24, 2022

I guess don't use a C99 linter on a C89 codebase?

dale_glass · on Feb 24, 2022

One thing I've wondered for some time is why don't the Linux developers modify the compiler to suit their needs better.

At that project scale, wouldn't it start making sense to start solving problems like "If it were possible to write a list-traversal macro that could declare its own iterator [...]" by adding the functionality you want to GCC?

jancsika · on Feb 24, 2022

Current situation-- do nothing but yell at the compiler devs. Benefit: sometimes they listen. Cost: you cannot always (or maybe even often) get the compiler to behave as you think it should because you don't control it.

Deathtrap situation-- maintain an operating system and a fork of a compiler. Benefit: you can get more control over the compiler. Cost: you still cannot always get the compiler to behave as you think due to time constraints. Death cost: your first cost is multiplied by the fact that you're now maintaining a goddamned compiler.

I rankly speculate Linux is an extant project at its scale because it has refused to fight on two fronts like this. (And yelling across a border isn't the same as crossing it.)

Edit: clarifications

Too · on Feb 26, 2022

On the other front they decided to write their own VCS.

aliswe · on Feb 25, 2022

- and you will have a compiler which is out of date.

brenns10 · on Feb 24, 2022

They do! Consider this patch which adds the GCC implementation necessary to make "static keys" work:

https://gcc.gnu.org/legacy-ml/gcc-patches/2009-07/msg01556.h...

Static keys are a kernel API that allows code modification at runtime to achieve zero cost feature flags, like tracing:

https://www.kernel.org/doc/html/latest/staging/static-keys.h...

kllrnohj · on Feb 24, 2022

GCC already has that functionality, it's called C++. Like that macro is just a crappy version of std::for_each.

It doesn't seem useful to make a "C+" instead of switching to a language that just has the feature set the kernel needs. Like Rust, which is gaining some support within the Linux kernel already.

mhh__ · on Feb 25, 2022

They've dug themselves into a hole when it comes to C++.

If you started writing a major project in C now you'd rightfully get sacked, but open source projects like this are often dominated by the opinions of those who only work on that project. That doesn't mean they're automatically wrong but just that they're often massively detached from any feedback apart from disaster.

dale_glass · on Feb 24, 2022

C++ was discussed as having too many undesirable characteristics.

But that's exactly why I think a customized language for the kernel is an interesting idea. You could get pretty much exactly what you want for the kernel. Add features you need, remove any undesirable behavior or features.

For most programs that would be too much complexity, but the kernel has very particular needs. And I think a similar in spirit approach has worked very well with Qt.

jcelerier · on Feb 24, 2022

> C++ was discussed as having too many undesirable characteristics.

C++ allowed the Serenity OS people to produce an entire OS with a GUI stack able to play Diablo and their own web browser in something like two years. It's depressing to think where we could be today in terms of OS if the Linux and GNU people weren't as insistent on their hate of C++

von_lohengramm · on Feb 25, 2022

Note that Serenity OS also has its own stdlib "AK" that excludes a lot of criticized C++ "features". If the Linux devs were to carefully splice out the Good from the Bad and Ugly of C++, then maybe it'd suit their tastes just fine. I think the bigger factor here is that Linux became decidedly anti-C++ back when C++ was actually pretty crumby.

jcelerier · on Feb 25, 2022

The Linux kernel has its own libc-ish too.

tayo42 · on Feb 24, 2022

Why is c++ necessary? Shouldn't any programing language be able to do the same things. Maybe with different amounts of code

Koshkin · on Feb 24, 2022

Sure, as my prof once noted, "Why, with the assembler you just need to type more!"

microtherion · on Feb 25, 2022

You must have had one of those progressive professors who did not consider assemblers "a waste of a valuable scientific computing instrument [...] to do clerical work": http://worrydream.com/dbx/

reasonabl_human · on Feb 25, 2022

I love historical anecdotes like this. Goes to show that we should always be questioning our status quo and thinking bigger.

josefx · on Feb 24, 2022

Please write a JavaScript program that will not deploy to your production system if you accidentally pass a string to any function that expects an integer.

cprecioso · on Feb 25, 2022

So… typescript?

mhh__ · on Feb 25, 2022

Because it's better.

Simple question, simple answer. Using C is purely an act of risk-aversion, contrarianism, and fashion.

edgyquant · on Feb 25, 2022

You could build Serenity in C in a similar about of time

awesomekling · on Feb 25, 2022

Perhaps someone could, but I certainly couldn't. :)

jcelerier · on Feb 25, 2022

What are the arguments towards this ?

Koshkin · on Feb 24, 2022

The desirable characteristics of C++ overweigh, by far, its "undesirable" ones (whatever those may be, and which by the way you can for the most part safely ignore, if you wish).

chaxor · on Feb 24, 2022

I don't know anything about C++ really, but I have been interested in learning it several times. The biggest deterrent for me has been the huge number of languages that you have to know to learn c++ it seems. Every new version comes with so many different features and seemingly (by looking for example repos on git) different paradigms that it essentially feels like many different languages.

This is probably one of the biggest reasons I may turn to rust over C++ - simply less features creep due to less time being around.

Can you comment on how wrong I am about my feelings this way?

Koshkin · on Feb 24, 2022

Yes, I understand, I would probably feel the same if I wanted to learn C++ all at once and for its own sake. Luckily, I have never had to do such a thing. Indeed, I may never learn C++ in its entirety, and I may even miss some "important" pieces, but I am OK with that. I am very comfortable using C++, and I still continue picking up pieces of wisdom here and there - as I go. My advice, learn by example, start with a small project and go from there.

kllrnohj · on Feb 25, 2022

Learning/teaching C++ is definitely a sore spot about it. Many still teach C++98 which is... Bad. It'd be like still teaching Java 1.3. yeah the syntax is the same basic shape, but pretty much everything else is different.

C++11 and newer are all largely one "category" in terms of recommendations. The CppCoreGuidelines is a good place to cover all that, but it's not a from-scratch introduction by any means

midjji · on Feb 25, 2022

Not wrong, but its a feature, rather than a problem.

C++(17+) template based metaprogramming far more powerful and generic than what you can do in eg rust. Converting eg a rust or go or python, or julia, etc library into c++ is pretty straightforward, just use appropriate overloads, and a few template tricks, and you can mostly copy the code directly. But copying between these, or from c++ is much harder.

The solution is to learn one approach suitable for the problem you have right now, then widen over time. The many languages of c++ as you describe is less of a problem, the problem is that some approaches are deeply and fundamentally flawed, but still in common use.

kllrnohj · on Feb 25, 2022

Let's say you wanted to do that but also stay in the C family. It's going to be a lot easier to take C++ and just ban all the stuff you don't like (such as via lint rules) than to make a new language. In the former case you continue to benefit from all the ecosystem compiler & tooling improvements (ide support, static analysis, etc...). In the latter case you're on your own. Which is no small burden here.

I'm not necessarily advocating that Linux should switch to C++, just that there's probably not a good reason to invest in a new C/C++ hybrid language at this point in time. Not when C++11 is honestly pretty good, but also Rust, Zig, or even D's betterC all already exist.

tialaramex · on Feb 25, 2022

C++ has implicit allocation and Linus doesn't like that, for good reason. While it's amusing to imagine trying to "ban all the stuff you don't like" when that means core language features, it is not a realistic plan.

In Rust, allocation lives in a library, alloc, and so the Rust for Linux project did all the work to offer alloc (it's full of useful stuff and it isn't like the Linux kernel can't allocate memory) but without implicit allocation.

If you use Rust to write say a Linux command line program, you can write

  greeting += " and welcome traveller";

... and of course implicitly this is an allocation, 'cos it's not like this greeting variable just magically already has enough space to append a string. But in Rust for Linux, you can't do that, the implicitly allocating += operator is not provided on this type in their alloc library. If you want to say "Allocate more space for the greeting" you can do that of course, just as you can today in C but you must do so explicitly and so when you try to add 16GB of extra string space because you're an idiot, the API you had to explicitly call gives you an error and that's your problem.

However the most critical reason Linux doesn't have C++ is that C++ proponents didn't do the work. Now, that will probably be because "reform the entire language to suit Linus" wasn't a viable plan, but the fact is that Linus can't accept patches that nobody writes, so even if you're sure C++ would be viable without drastic changes, you didn't write the patchset that does it. Likewise if people don't send Linus patches to do C11 it probably won't happen.

kllrnohj · on Feb 25, 2022

> C++ has implicit allocation

Ah, but see, it doesn't. Linus was wrong about many things in his rant. This being one of them.

Now the standard library does indeed have things that do implicit allocations, such as std::string. But these, like with Rust, are distinct from the language & replaceable. Would it be effort to make a kernel-safe std:: replacement? Yes. Would it be a lot of work? Not really. And it's the kind of thing Linux has been doing for decades with C anyway. It's not like they use a standard libc implementation (much less a rich libc implementation like glibc), for example.

And it's something game devs have been doing with C++ for decades without any issues, too.

So for this one you don't even have to ban anything. Just don't pass a standard library implementation to the compiler. It doesn't come with one, after all. You have to add it. So you could just... Not do that.

As for nobody did the work... That's true. But Linus also pretty much nuked the entire concept of using C++, regardless of the what or how. Time seems to have changed his mindset on some of those things, hence his reaction to Rust, which largely makes all this moot. But we shouldn't confuse short term politics with technical issues, either.

Galanwe · on Feb 25, 2022

> Linus was wrong about many things in his rant. This being one of them.

Its not what is being discussed here. C++ has an implementation defined memory model. That just cannot work with Linux out of the box. You need an OS and compiler designed from the ground up to handle this.

kccqzy · on Feb 25, 2022

C++ has a well defined memory model: https://en.cppreference.com/w/cpp/language/memory_model And it's not that different from the C memory model. How did Linux work with C?

Galanwe · on Feb 25, 2022

> C++ has a well defined memory model

Only since C++11, and it's not a full blown memory model.

gpderetta · on Feb 25, 2022

C++ has with a close, approximation, the same memory model as C, both concurrent and otherwise. In fact C11 just adopted the C++11 concurrent memory model wholesale.

jcelerier · on Feb 25, 2022

it's defined enough for C to have adopted it since C11 (https://people.mpi-sws.org/~viktor/papers/cpp2015-invited.pd...)

pjmlp · on Feb 25, 2022

Arduino, ARM mbed, Symbian, macOS, Windows, BeOS, IBM i and z/OS accepted the work of C++ proponents.

There is nothing when can do to change political views.

And lets not pretend that Rust in the kernel won't suffer from creative uses of macros, and there is still a big laundry list of issue to fix before it actually makes it.

Or that for the time being, Rust compilers need to link to C++ code to actually work, at least until Cranelift is a match in code quality against LLVM or GCC.

So in the end Linus gets C++ into the kernel, even if indirectly.

oconnor663 · on Feb 25, 2022

> Rust compilers need to link to C++ code to actually work

GCC itself has been written in C++ since 2010.