Hacker News new | past | comments | ask | show | jobs | submit login

May be a little OT but came across this technique called MABs multi armed bandits , related to , finding optimal ways to decide among a sequence of actions and measuring their utility

https://en.m.wikipedia.org/wiki/Multi-armed_bandit

Has applications in a wide variety of fields : optimal design of clinical trials, public policy decision making.

apparently has some relation to the work abhijit banerjee & esther duflo, the nobel prize winning economist couple have done on design of experiments, to measure impact of developmental interventions




Not OT at all, IMO. As it happens, the linked book actually touches on MAB problems explicitly, albeit briefly. And as the book points out, you can formulate a MAB problem as specific form of Markov Decision Process, and MDP's are referenced extensively in the book.

I find MAB's incredibly interesting, and for pretty much the same reason(s) I find the rest of this stuff interesting.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: