There are hundreds of reasons why this wouldn't work, mainly the fact that almost nobody watches the same ads, meaning you're just rendering 100,000+ different versions of the same video depending on who watches it.
You don’t need to render the whole video, just the frames between the last keyframe of the content and the first keyframe of the ad. The rest can just be cached as is.