I thought the plan was to have common code for the generic, non-critical parts like graph handling, while operators would be shipped in multiple variants, keyed and loaded at runtime by CPUID. You can even delegate that to glibc if they are packaged as shared libraries and it's a recent enough version. Maybe in 1.7?