Site Search   view print login

Resources » Publications

Record Number10305
Reference TypeConference Proceedings
Author(s)Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal
Yearsubmitted
TitleReinforcement Learning of Motor Skills in High Dimensions: A Pah Integral Approach. (MANUSCRIPT UNDER REVIEW. SUGGESTIONS WELCOME))
Keywordsreinforcement learning, stochastic optimal control

Abstract

Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far due to the computational difficulties that reinforcement learning encounters in high dimensionsal continuous state-action spaces. In this paper, we derive a novel approach to RL for parameterized control policies based on the framework of stochastic optimal control with path integrals. While solidly grounded in optimal control theory and estimation theory, the update equations for learning are surprisingly simple and have no danger of numerical instabilites as neither matrix inversions nor gradient learning rates are required. Empirical evaluations demonstrate significant performance improvements over gradient-based policy learning and scalability to high-dimensional control problems. Finally, a learning experiment on a robot dog illustrates the functionality of our algorithm in a real-world scenario. We believe that our new algorithm, {\bf P}olicy {\bf I}mprovement with {\bf P}ath {\bf I}ntegrals (${\bf PI}^2$), offers currently one of the most efficient, numerically robust, and easy to implement algorithms for RL in robotics.
Notesclmc
Link to PDFhttp://www-clmc.usc.edu/publications//E/PI2.pdf

Papers are available as Adobe PDF ".pdf" files. Adobe Reader is available for free for all computer platforms.
Page last modified on August 10, 2006, at 06:47 PM