The Eponymous Pickle: Bots Build their Own Language

Friday, March 17, 2017

Bots Build their Own Language

More on applications of reinforcement. We did something similar using a genetic driver, but no mention of that here. Would that improve learning?

It Begins: Bots are Learning to Chat in their own Language by Cade Metz In Wired:

" .... As detailed in a research paper published by OpenAI this week, Mordatch and his collaborators created a world where bots are charged with completing certain tasks, like moving themselves to a particular landmark. The world is simple, just a big white square—all of two dimensions—and the bots are colored shapes: a green, red, or blue circle. But the point of this universe is more complex. The world allows the bots to create their own language as a way collaborating, helping each other complete those tasks.

All this happens through what’s called reinforcement learning, the same fundamental technique that underpinned AlphaGo, the machine from Google’s DeepMind AI lab that cracked the ancient game of Go. Basically, the bots navigate their world through extreme trial and error, carefully keeping track of what works and what doesn’t as they reach for a reward, like arriving at a landmark. If a particular action helps them achieve that reward, they know to keep doing it. In this same way, they learn to build their own language. Telling each other where to go helps them all get places more quickly. .... "

The Eponymous Pickle

About Me

RSS

Blog Archive

Friday, March 17, 2017

Bots Build their Own Language

No comments: