No, you can concatenate video segments with basically zero cost if the segments are encoded using the same encoding parameters and afaik youtube already re-encodes user videos.
Simply refuse to send original stream packets / ignore seek requests by the client until the ad duration has passed. Sure you could pause / mute / alt tab for that duration but the point is being annoying so people buy premium if they hate ads so much.