Main » Publications by Topic

This list is automatically created, please see publications by year in order to have a more chronological overview on my publications. Note that the list on this page is automatically generated and as such always overlapping due to overlapping keywords.

Reinforcement Learning

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

Control

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

Learning Motor Primitives

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

Robotics

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

Human Motor Control

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

Book Reviews

Record Number1773
Reference TypeReport
Author(s)Kwee, I.;Hutter, M.;Schmidhuber, J.
Year2001
TitleGradient-based reinforcement planning in policy-search methods
Keywordsreinforcement learning policy gradients model-based
NotesAn interesting paper that derives the policy gradient theorem in a differernt way for discrete worlds. Jan work is a clear superset of this. The authors achieve efficient learning by learning the model (state transition probs). All can be formluated nicely in Jan's RL framework
Place PublishedManno CH-6928, Switzerland
PublisherIDSIA
Short TitleGradient-based reinforcement planning in policy-search methods
ISBN/ISSNIDSIA-14-01
URL(s) http://www-clmc.usc.edu/publications/K/kwee-TR-IDSIA-14-01.pdf

The majority of the publications can also be obtained by Google Scholar where incomplete lists of citations are also given.


Page last modified on September 11, 2008, at 12:57 AM
Designed by: N.Ohanyan & J.Peters. Powered by PmWiki.