Basic Mathematical Background: Please review this cribsheet to make sure you understand the concepts therein. You may also find these resources useful as occasional reference material.
On Using Matlab: Take a look at this handout Introduction to MATLAB giving an introduction to MATLAB (you may ignore the section about NETLAB). Further MATLAB tutorials are available at UNH Matlab Tutorial, US Navy Matlab Tutorial and MTU Introduction to Matlab.
| Date: |
Lecture content: |
|
| January 15 |
Introduction Slides (pdf) Reading: Ch 1 of Sutton & Barto book |
|
| January 18 |
Multi-Armed Bandits Slides (pdf) Reading: Ch 2 of Sutton & Barto book |
|
| January 22 |
A phenomenological account of Q-learning Slides (pdf) Reading: Ch 2 of Sutton & Barto book |
|
| January 25 |
Formulation of the Reinforcement Learning problem - Markov Decision Processes Slides (pdf) Reading: Ch 3 of Sutton & Barto book Optional Reading: If you are curious about the Markov Chain portion and want to know more, there are many textbook type references, e.g., Ch 1 of a book by J. Norris. |
|
| January 29 |
Value functions and the Bellman equation Slides (pdf) Reading: Ch 1 of Szepesvari book |
|
| February 1 |
Value Prediction Slides (pdf) Reading: Ch 2.1 of Szepesvari book |
|
| February 5 |
Reinforcement learning: Examples
Learning Slides (pdf) Reading: see slides for reading suggestions | |
| February 8 |
Biological Reinforcement learning Slides (pdf) Reading: see slides for reading suggestions | Homework 1 assigned |
| February 12 |
POMDP Slides (pdf) Chapters 15 and 16 of S. Thrun, W. Burgard, D. Fox, Probabilistic Robotics, MIT Press. |
|
| February 15 |
POMDP Slides (pdf) |
|
| February 26 |
Hierarchical RL 1 Slides (pdf) Literature on last slide. |
|
| March 1 |
Hierarchical RL 2 Slides (pdf) Literature on last slide. |
|
| March 5 |
Large State Spaces Slides (pdf) Based on C. Szepesvari: Algorithms for RL, Chapter 2.2 | |
| March 7 |
Deadline has been extended by one week (because of ILW)
|
Homework 1 due (Deadline: 16:00) |
| March 8 |
Policy gradient methods (Actor-critic) Slides (pdf) |
Homework 2 assigned |
| March 12 |
Natural Actor Critic Slides (pdf) |
|
| March 15 |
NO LECTURE |
|
| March 19 |
Particle swarms and Further topics in RL Slides (pdf) |
|
| March 22 |
Efficient representations for RL Slides (pdf) |
|
| March 26 |
RL in neural networks (bonus lecture) Slides (pdf) |
|
| March 28 |
Homework 2 due (Deadline: 16:00) |
|
Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk Please contact our webadmin with any comments or corrections. Logging and Cookies Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh |