Just using I$ hit ratio is problematic in many ways. E.g: - You'll probably not ...

gpderetta · on Dec 2, 2022

Re 2, the work per instruction doesn't matter if you compare the same program/program execution, in practice you will get an estimated of the resident set size over the amount of work.

All your other points do stand and that's what I mean with 'is very machine dependent'. And yes, if you want to isolate fully the effect of instruction density an emulator might be the only solution. Still I think that profiling counters can get you 90% there.

mbitsnbites · on Dec 2, 2022

I just don't see how I would run profiling counters on a z15 machine for instance ;-)

With my limited resources this got me much more architecture coverage.

And as usual, the answers are in the data, it's merely a matter of what questions you think that you are asking...

gpderetta · on Dec 5, 2022

Yes, z15 is an issue :)

Still an arm vs x86 should be doable and maybe even riscv.