The Sarcos Dexterous Arm learns the skill of pole balancing and a pendulum-swing-up from a demonstration of a teacher. All appled learning methods involve model-based reinforcement learning.
Using a novel methods of encoding movement primitives as nonlinear dynamical systems, the humanoid robot DB is taught how to perform tennis forehand and backhand swings. Note that in imitation, the movement is performed towards the colored ball, i.e., the movement primitive is generalized to new targets.
As in the previous movie, dynamical systems are used to encode movement primitives, just that we show how to encode rhythmic behavior this time. Note that the learned primitives can be sped up easily, can be changed in amplitude, and can be made robust towards external perturbations. The latter point is demonstrated by manually interfering with the robot's movement, which leads to a temporary abortion of the movement pattern. After the robot is released, it continues the pattern where it was stopped -- i.e., the robot planning was gracefully interupted.
Designed by: Nerses Ohanyan & Jan Peters
Page last modified on October 03, 2011, at 11:19 PM