Worth trying, especially with those slow stuttering drones on youtube, and as the strangeattractor says you can always rewind selected parts as needed.
The idea of varied speed viewing has been well known in times past for video as well as audio. There are also the gap compressors that are useful in audio streams that serve the words with variably reduced word gaps. I do wish they also had a stutter, um, ahh stripper for audio that have not used post production editing to eliminate gaps/ums. ahs and stutters etc.
There are a number of training courses that help you in becoming an audio reader that many youtubers should well have a look at.
https://www.google.com/search?q=audio+reader+training&rlz=1C...