Implementing fork() on the Mill CPU (2014)

JoeAltmaier · on March 9, 2015

Not just fork, also shared libraries of any kind? Any time multiple processes share the same virtual addresses. Because shared TLB.

marcosdumay · on March 9, 2015

Shared libraries are easier. Both processes just use the same memory.

The problem here is page reusing with copy-on-write. I have no idea how one'd implement that.

ChuckMcM · on March 9, 2015

This is what I was thinking as well. Single address space OSes are actually quite powerful, and fork is 'trivial' if you have the correct page protection mechanisms. Basically a fork consists of starting a new process context, and each page that gets modified, gets a new page where internal pointers in the page are updated and external pointers are left alone. The trick there is managing a list of relocatable references for every page with tags of 'internal' or 'external' to facilitate fixups.

JoshTriplett · on March 9, 2015

In particular, a single address space means that if you copy a page on write, you also have to copy any page that points to it, so that you can fix up pointers to it. And that's not transparent to userspace. With virtual address spaces, you can copy a page in physical memory without changing its virtual address.

Then again, many of the same considerations apply when using ASLR.

comex · on March 9, 2015

In practice, it'd be quite different because with ASLR, the compiler records the locations of all address references in the binary, so the OS knows what to fix up. At runtime, though, C programs normally leave no indication in memory of what is a pointer and what is pure data; even if you changed the compiler to emit this and the allocator to track it, you'd have problems with fairly common constructions like custom allocators, unions where it may be nontrivial to determine which alternative is in use (especially since the data may actually be uninitialized), tagged pointers, hashes based on the pointer value, et cetera. Garbage collectors for C run into the same problems and forbid some of those constructions, but they can always fall back on not collecting an allocation if they're unsure whether a reference to it is a real pointer or just an integer with the same value. If you're actually relocating things, you can't risk accidentally changing the value of some integer.

Honestly, I don't expect such a scheme to be implemented, considering how complicated (and limited) it would be. It sounds more realistic to implement a scheme that implements the full semantics even if at a severe performance cost, as suggested in the article, and expect ported Mill programs to stop using fork without exec for anything important.

anon4 · on March 9, 2015

You could drop the copy-on-write mechanism and just use copy-always for fork and have a base register which offsets all pointer access (analogous to the mostly disused SS register on x86). There could be some mechanism to detect fork-then-exec for performance.

marcosdumay · on March 10, 2015

Yep. You need some mechanism for running several times the same code anyway. Anything that solves that, solves the problem of forking without copy-on-write.

The memory model is one of the features of the Mill that I'm most curious about (the other one being access control).

JoeAltmaier · on March 9, 2015

I can guess: on kernel process switch, rewrite the TLB entry 'manually'. I used to have to do that on old 8086 OS for software interrupt vectors (because no TLB).

TazeTSchnitzel · on March 9, 2015

Shared libraries use portals, IIRC.

gsg · on March 9, 2015

For mutable data - text or rodata pages sharing the same addresses should be fine (if I understand the scheme correctly).

tinco · on March 10, 2015

Probably being way ahead of things, but is the instruction set defined at this point? I would love to play around on an emulator. I browsed around their website a bit, but saw only youtube videos explaining general information, same for the white paper.

peterfirefly · on March 10, 2015

The Wiki on their page contains more information, including this:

http://millcomputing.com/wiki/Instruction_Set

http://millcomputing.com/instructions.html

http://millcomputing.com/wiki/Architecture

The comp.arch newsgroup and the realwordtech web site contain have plenty of discussion of the Mill. Search for 'Ivan Godard' or 'igodard'.

http://www.realworldtech.com/forum/?threadid=147612&curposti...

http://www.realworldtech.com/forum/?threadid=147482&curposti...

http://www.realworldtech.com/forum/?threadid=147185&curposti...

Some details of the architecture have not been revealed yet due to patents having to be filed for first, so the public information is incomplete in places. Other things have been changed recently. Threading and IPC is an example of the former and the way vectors work is an example of the latter (and maybe also the former). Both are scheduled to be revealed April 10 in Amsterdam:

http://awesomeit.nl/

Some of the encoding details are also not finalized yet or they might be very incompletely described. The basic shape of the beast is there, though.

There is no public emulator but there is almost enough information that you can at least start writing your own for fun.

white-flame · on March 10, 2015

The machine instruction set is generated from the CPU generating tools per family member.

The lowest level you'd want to get in terms of hand-writing "assembly" would be genAsm, http://millcomputing.com/wiki/GenAsm_%28language%29 , but there are no released emulators or simulators yet.

genAsm is a pretty generic language, and the OS's specializer will compile that to the architecture-specific machine instructions.