Hacker News new | past | comments | ask | show | jobs | submit login
Deep Successor Reinforcement Learning (2016) (arxiv.org)
69 points by aaronjg on Feb 7, 2017 | hide | past | favorite | 2 comments



It's not clear to me how this is interestingly different from model-based RL, where you learn the state function and reward function, and then use various types of simulation to learn a value function. I guess I'll have to read more than the abstract...


Section 3.2 shows the successor representation (SR) definition. If I'm reading it correctly the SR might also be described as the discounted stationary distribution over states.

I haven't seen SR before in the RL literature, but the paper argues that this representation is useful for sub-goal identification. I guess I'll have to read more than the abstract as well :)




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: