Note: This syllabus will be modified continuously to accommodate the progress and interests of the course participants!
Date  Topic  Handouts 
Sept. 3  Introduction to Reinforcement Learning  Slides, Sutton Book Chapters 15 
Sept. 10  Function Approximation in Reinforcement Learning, Optimal control along trajectories: LQR, LQG and DDP  Sutton Book Chapter 8, Todorov2005 
Sept. 17  Research on DDP and Function Approximation for RL  Tassa2007, Slides 
Sept. 24  Research on DDP and Function Approximation in RL  Doya2000, Morimoto2003 
Oct., 1  Gaussian Processes for Reinforcement Learning, Value function learning along trajectories (fitted Q iteration), Least Squares Temporal Difference Methods  Deisenroth2009, Lagoudakis2002, Ernst2005 
Oct.. 8  Policy Gradient Methods: REINFORCE, GPOMDP, Natural Gradients  Williams1992, Sutton2000, Peters2008, Slides 
Oct.. 15  Research on Policy Gradient Methods, Introduction to Path Integral Methods  Tedrake2005, Bagnell2003 
Oct. 22  Path Integral Methods for Reinforcement Learning  Theodorou2010, Todorov2009, Kober2009 
Oct. 29  Path Integral Methods for Reinforcement Learning (continued)  Slides 
Nov. 5  Sketch of Planned Projects, Modular Learning Control  Tedrake2009, Todorov2009 
Nov. 12  Inverse reinforcement learning  Dvijotham2009, Abbeel2009, Ratliff2009 
Nov. 19  Dynamic Bayesian networks for reinforcement learning  Toussaint2006, Vlassis2009 
Dec. 3  Project presentations.  
