May be a little OT but came across this technique called MABs multi armed bandit...

mindcrime · on April 23, 2022

Not OT at all, IMO. As it happens, the linked book actually touches on MAB problems explicitly, albeit briefly. And as the book points out, you can formulate a MAB problem as specific form of Markov Decision Process, and MDP's are referenced extensively in the book.

I find MAB's incredibly interesting, and for pretty much the same reason(s) I find the rest of this stuff interesting.