May be a little OT but came across this technique called MABs multi armed bandits , related to , finding optimal ways to decide among a sequence of actions and measuring their utility
Has applications in a wide variety of fields : optimal design of clinical trials, public policy decision making.
apparently has some relation to the work abhijit banerjee & esther duflo, the nobel prize winning economist couple have done on design of experiments, to measure impact of developmental interventions
Not OT at all, IMO. As it happens, the linked book actually touches on MAB problems explicitly, albeit briefly. And as the book points out, you can formulate a MAB problem as specific form of Markov Decision Process, and MDP's are referenced extensively in the book.
I find MAB's incredibly interesting, and for pretty much the same reason(s) I find the rest of this stuff interesting.
https://en.m.wikipedia.org/wiki/Multi-armed_bandit
Has applications in a wide variety of fields : optimal design of clinical trials, public policy decision making.
apparently has some relation to the work abhijit banerjee & esther duflo, the nobel prize winning economist couple have done on design of experiments, to measure impact of developmental interventions