As far as the 64 vCPU finding, that's quite possibly because it's crossing NUMA modes. GCE's virtualization hides NUMA information unfortunately (at least as far as I've ever seen), so there's no way to handle this in software even.
Would be interesting to see these benchmarks on Haswell/Broadwell vs Skylake.
Would be interesting to see these benchmarks on Haswell/Broadwell vs Skylake.