Reinforcement learning was applied after the basic model was initialized with im... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

caxap on July 23, 2010 | parent | context | favorite | on: Robot needs 50 tries to learn how to flip pancakes...

Reinforcement learning was applied after the basic model was initialized with imitation. Maybe that can partially explain the small number of steps.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact