Saturday, May 19, 2018

OpenAI Lets Robots Learn from Hindsight

In IEEE Spectrum.

OpenAI Releases Algorithm That Helps Robots Learn from Hindsight

It's not a failure if you just pretend that you meant to do it all along  By Evan Ackerman

Being able to learn from mistakes is a powerful ability that humans (being mistake-prone) take advantage of all the time. Even if we screw something up that we’re trying to do, we probably got parts of it at least a little bit correct, and we can build off of the things that we did not to do better next time. Eventually, we succeed.

Robots can use similar trial-and-error techniques to learn new tasks. With reinforcement learning, a robot tries different ways of doing a thing, and gets rewarded whenever an attempt helps it to get closer to the goal. Based on the reinforcement provided by that reward, the robot tries more of those same sorts of things until it succeeds. ... " 

