The input steam is half the story, you should also match the whole observable state of the machine for an accurate emulation, because some other test might actually depend on it.
Triggered on all the clocks and decide which clock matters every case by hand.
To get that, you would need a set of hardware debuggers plugged into the bus and chips. And a lot of inside knowledge to decide if a deviance is random enough to not have to be emulated.
To get that, you would need a set of hardware debuggers plugged into the bus and chips. And a lot of inside knowledge to decide if a deviance is random enough to not have to be emulated.