DeepLoco Learns Not Only To Walk But To Kick A Ball
Written by Mike James   
Thursday, 10 August 2017

Learning to walk is hierarchical. or so DeepLoco seems to suggest. No this is not deep learning madness, but deep learning walking - and it looks good.

The thing about the task of teaching a robot to walk is that it looks easy. You just achieve balance and then place one foot in front of the other. If you have ever seen almost any video of a robot actually trying to walk you might well decide that is is a much more difficult task than you first thought.

DeepLoco, and yes I have to admit that too many things are called DeepSomething these days, is an attempt to learn to walk using a hierarchical approach. First a network learns a low-level control of the limbs to walk in a robust way. Next a higher-level network learns how to best use the lower-level abilities to navigate a terrain. It's a lot like how we do it - legs get on with the walking and we look out for the best way to go.

The networks are trained using reinforcement learning. That is, the networks are not told how to walk, just how well they are doing.

"Results are demonstrated on a simulated 3D biped. Low-level controllers are learned for a variety of motion styles and demonstrate robustness with respect to forcebased disturbances, terrain variations, and style interpolation. High-level controllers are demonstrated that are capable of following trails through terrains, dribbling a soccer ball towards a target location, and navigating through static or dynamic obstacles."

The results, presented at SIGGRAPH 2017, are fairly impressive:

 

 

If you would like to see more details of how it all works then watch the full video:

 

 

It would be good to see this, or a similar algorithm, applied to a real robot. Simulations are fine, but the real world has a way of throwing the unexpected at would-be walkers.

The research team has some ideas where to go next:

"An interesting direction would be to learn adversarial and cooperative behaviors between multiple characters,such as for playing tag or for collision avoidance when crossing paths. In the future, we also hope to be able to directly learn to drive muscle-actuated 3D models."

 football

More Information

DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning

Paper (pdf)

XUE BIN PENG and GLEN BERSETH, University of British Columbia KANGKANG YIN, National University of Singapore MICHIEL VAN DE PANNE, University of British Columbia

Related Articles

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

 

 

Banner


Google Opensources Privacy Library
08/11/2024

Google is making a new differential privacy library available as open source. PipelineDP4J is a Java-based library that can be used to analyse data sets while preserving privacy.



OpenAI Releases Swarm
25/10/2024

OpenAI has released an experimental educational framework for exploring ergonomic, lightweight multi-agent orchestration. Swarm is managed by the OpenAI Solution team, but is not intended to be used i [ ... ]


More News

 

espbook

 

Comments




or email your comment to: comments@i-programmer.info

 

 

Last Updated ( Thursday, 10 August 2017 )