I would presume you'd want to have that explore option use different algorithms. When in the exploration situation you still want to narrow down what kinds of topics people want to find and not just a random assortment. You're always going to have to "read people's minds" to some extent because there's too much information and too many options out there. You should probably increase this exploration value as the user continues to scroll.
Also, if someone is scrolling and scrolling you'd probably want to increase your exploration hyperparameter as time goes by. So thus we're pretty much automating what is done above by user behavior. You don't really need to read someone's mind because they are already explicitly telling you that what you've presented to them isn't what they are looking for (otherwise they would have clicked it by now).
Though, for convenience I think you could find clear patterns in certain behaviors. For example, I'm one of those people that like to fall asleep to a show that I've seen a few times already. It is pretty easy to predict that. You can sure predict that if I'm opening Netflix late at night that I want to watch Star Trek.
Also, if someone is scrolling and scrolling you'd probably want to increase your exploration hyperparameter as time goes by. So thus we're pretty much automating what is done above by user behavior. You don't really need to read someone's mind because they are already explicitly telling you that what you've presented to them isn't what they are looking for (otherwise they would have clicked it by now).
Though, for convenience I think you could find clear patterns in certain behaviors. For example, I'm one of those people that like to fall asleep to a show that I've seen a few times already. It is pretty easy to predict that. You can sure predict that if I'm opening Netflix late at night that I want to watch Star Trek.