DeepLoco Learns Not Only To Walk But To Kick A Ball

Written by Mike James

Thursday, 10 August 2017

Learning to walk is hierarchical. or so DeepLoco seems to suggest. No this is not deep learning madness, but deep learning walking - and it looks good.

The thing about the task of teaching a robot to walk is that it looks easy. You just achieve balance and then place one foot in front of the other. If you have ever seen almost any video of a robot actually trying to walk you might well decide that is is a much more difficult task than you first thought.

DeepLoco, and yes I have to admit that too many things are called DeepSomething these days, is an attempt to learn to walk using a hierarchical approach. First a network learns a low-level control of the limbs to walk in a robust way. Next a higher-level network learns how to best use the lower-level abilities to navigate a terrain. It's a lot like how we do it - legs get on with the walking and we look out for the best way to go.

The networks are trained using reinforcement learning. That is, the networks are not told how to walk, just how well they are doing.

"Results are demonstrated on a simulated 3D biped. Low-level controllers are learned for a variety of motion styles and demonstrate robustness with respect to forcebased disturbances, terrain variations, and style interpolation. High-level controllers are demonstrated that are capable of following trails through terrains, dribbling a soccer ball towards a target location, and navigating through static or dynamic obstacles."

The results, presented at SIGGRAPH 2017, are fairly impressive:

If you would like to see more details of how it all works then watch the full video:

It would be good to see this, or a similar algorithm, applied to a real robot. Simulations are fine, but the real world has a way of throwing the unexpected at would-be walkers.

The research team has some ideas where to go next:

"An interesting direction would be to learn adversarial and cooperative behaviors between multiple characters,such as for playing tag or for collision avoidance when crossing paths. In the future, we also hope to be able to directly learn to drive muscle-actuated 3D models."

football

More Information

DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning

Paper (pdf)

XUE BIN PENG and GLEN BERSETH, University of British Columbia KANGKANG YIN, National University of Singapore MICHIEL VAN DE PANNE, University of British Columbia

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Camunda Adds Agentic Orchestration
08/04/2025

Camunda has announced new agentic orchestration capabilities that companies can use to model, deploy, and manage AI agents in their end-to-end processes. Camunda is also introducing AI-powered feature [ ... ]

+ Full Story

Linkerd 2.18 Adds Protocol Declarations
23/04/2025

Version 2.18 of the Linkerd service mesh has added features aimed at making the software better at handling problematic situations, along with an experimental build of the proxy for Windows environmen [ ... ]

+ Full Story

More News

Comments

or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 10 August 2017 )

More Information

Related Articles

Comments