Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you just want calls and returns, can't you use one of the other PMUs for that? Or is sampling at the "1 sample per event" level higher overhead than IPT?


do you mean configuring the other PMUs to interrupt the core every function call / return?

If yes, then yes that is much much higher overhead than processor trace.


It's worth noting that aside from the overhead, function call / returns are not quite enough to reconstruct the callstack: tailcalls are just regular branch instructions.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: