Pity it actually fails at the truly complex levels. Gets stuck in decision minim...

teeki · on May 23, 2019

For longer levels, I think training on later parts of the level tend to be change the policy to not do as well in the earlier parts. I suspect it would do fine on the linear levels if the number of agents and batch size was increased.

marcosdumay · on May 23, 2019

> Cannot find the symbolic notion of progress and dies of boredom?

So, it gets burnout.

sametmax · on May 23, 2019

It makes it more human :)