It wouldn't be decoded on the CPU, but on the GPU. Or even specialized hardware. As convnets are being used more in image processing, that isn't too unrealistic. There is interest in making specialized consumer hardware for convnets.
And it doesn't need to work on every frame, it could pump out I-frames every 15 seconds or so.
And it doesn't need to work on every frame, it could pump out I-frames every 15 seconds or so.