> I know that cross-compiling Linux on an Intel X86 CPU isn't necessarily going ...

herpderperator · on June 1, 2021

No, it's not true. Just a common misconception because people believe it's some sort of emulation.

dan-robertson · on June 1, 2021

Some cross-compilation may need some emulation to fold constant expressions. For example if you want to write code using 80 bit floats for x86 and cross-compile on a platform that doesn’t have them, they must be emulated in software. The cost of this feels small but one way to make it more expensive would be also emulating regular double precision floating point arithmetic when cross compiling. Obviously some programs have more constant folding to do during compilation than others.

mastax · on June 1, 2021

My understanding is that LLVM already does software emulation of floating point for const evaluation, in order to eliminate any variation due to the host architecture.

https://llvm.org/doxygen/structllvm_1_1APFloatBase.html

messe · on June 1, 2021

Is constant folding going to be a bottle neck? In this particular instance, in the kernel, floating point is going to be fairly rare anyway, and integer constant folding is going to be more or less identical on 64-bit x86 and ARM.

tedunangst · on June 1, 2021

In theory, yeah. In practice, a native compiler may have slightly different target configuration than cross. For example, a cross compiler may default to soft float but native compiler would use hard float if the system it's built on supports it. Basically, ./configure --cross=arm doesn't always produce the same compiler that you get running ./configure on an arm system. As a measurable difference, probably pretty far into the weeds, but benchmarks can be oddly sensitive to such differences.

znpy · on June 1, 2021

there's no reason for a cross-compiler to be slower than a native compiler.

if your compiler binary is compiled for architecture A and emits code for an architecture B, it's going to perform the same as a compiler compiled for an architecture A and emitting code for the same architecture A.

karmakaze · on June 1, 2021

Well there's one. If people tend to compile natively much more often than cross-compile, then it would make sense to spend time optimizing what benefits users.

testific8 · on June 1, 2021

Yes but you probably would make those optimizations in C code and not assembly. The amd64 compiler is basicially the same C code whether or not it's been bootstrapped on armv8 or amd64.

cle · on June 1, 2021

Well to get a little nuanced, it depends on if the backend for B is doing roughly the same stuff as for A (e.g. same optimizations?). I have no idea if that's generally true or not.

mlyle · on June 1, 2021

There are some small nits, where representation of constants etc can be different and require more work for a cross-compiler.

cma · on June 1, 2021

Endianess differences would require more work, across all pointer math output, etc., though maybe still not significant.