Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Pity it actually fails at the truly complex levels. Gets stuck in decision minima due to lack of memory?

Or is the reward too sparse?

Cannot find the symbolic notion of progress and dies of boredom?



For longer levels, I think training on later parts of the level tend to be change the policy to not do as well in the earlier parts. I suspect it would do fine on the linear levels if the number of agents and batch size was increased.


> Cannot find the symbolic notion of progress and dies of boredom?

So, it gets burnout.


It makes it more human :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: