A compiler I worked on actually did this for all function calls. With recursion impossible (and whole-program compilation mandatory), huge gains can be made. And that compiler is used for pretty significant applications. Odds are many posters here have its microcode in their pocket.