Tentative Class Schedule


<
CSE 6369
Reinforcement Learning
Tentative Lecture and Assignment Schedule
Spring Semester 2014 - TTh 2:00 - 3:20
Class Date Readings Lecture Topics Assignments
1 01/14 1.1 - 1.6, Slides Course Overview and Introduction  
2 01/16 2.1 - 2.4, Slides Background - Evaluative Feedback  
3 01/21 2.5 - 2.10 Background - Utility and Decision Theory  
4 01/23 3.1 - 3.10, Slides Background - Probabilistic models and MDPs  
5 01/28 4.4, 4.7 Dynamic Programming Methods - Value Iteration  
6 01/30 5.1 - 5.8, Slides Monte Carlo Methods  
7 02/04 6.1 - 6.3 Temporal Difference Learning  
8 02/06 6.6 - 6.9 Actor-Critic Models  
9 02/11 4.1 - 4.6 Dynamic Programming Methods - Policy Iteration  
10 02/13 6.4 - 6.6 Value Iteration Methods  
11 02/18   On-Policy vs Off-Policy Learning Homework 1 due
12 02/20 7.1 - 7.6, Slides Value Updates  
13 02/25 8.1 - 8.3 Function Approximation in Reinforcement Learning  
14 02/27 8.4 - 8.7 Function Approximation in Reinforcement Learning  
15 03/04   Exploration vs. Exploitation Tradeoff  
16 03/06 Slides Policy Improvement and Policy Gradient  
  03/11   Spring Break - No Class  
  03/13   Spring Break - No Class  
17 03/18 Sutton NIPS 99 Paper  
18 03/20 Slides Policy Improvement and Policy Gradient Project 1 due (Simulation Code: 64 Bit Linux, OSX, Cygwin 32 bit graphics library)
19 03/25 9.1 - 9.3 Model-Based Learning  
20 03/27 9.4 - 9.6 Efficient Model-Based Learning
21 04/01   Efficient Model-Based Learning  
22 04/03   Learning in Partially Observable Systems  
23 04/08 Paper Learning in Partially Observable Systems  
24 04/10   Learning in Partially Observable Systems  
25 04/15   Learning in Partially Observable Systems Homework 2 due
26 04/17   Learning in Partially Observable Systems  
27 04/22   Hierarchical Reinforcement Learning  
28 04/24  Hierarchical Reinforcement Learning  
29 04/29   Student Presentations  
30 05/01   Student Presentations & Current Challenges due
31 05/06   Final Project Presentations (2:00pm-4:30pm)



Footnotes

...
All information is tentative and subject to change.


Manfred Huber
2014-01-14