Reinforcement Learning 2016/2017

Typically, lecture slides will be added/updated one day before the lecture. Lectures will be held between 12:10 - 13:00 in Teviot Lecture Theatre, Medical School, Doorway 5 on Tuesdays and same time same place on Fridays.
Basic Mathematical Background: Please review this cribsheet to make sure you understand the concepts therein. You may also find these resources useful as occasional reference material.
On Using Matlab: Take a look at this handout Introduction to MATLAB giving an introduction to MATLAB (you may ignore the section about NETLAB). A further MATLAB tutorial is available at MTU Introduction to Matlab.
Note that the coursework will also require other tools and programming environments, which will be introduced and explained in lectures.

Lecture content:
Assignments and Deadlines:
January 17, 2017
Slides (pdf)
Reading: Ch 1 of Sutton & Barto book
January 20, 2017
Multi-armed Bandits; Review of Markov Chains; Introduction to Markov Decision Processes
Slides (pdf)
Reading: Ch 2, 3 of Sutton & Barto book
January 24, 2017
Dynamic Programming
January 27, 2017
Monte Carlo methods, On-policy and Off-policy Control
January 31, 2017
Temporal Difference Methods, Q-Learning
February 3, 2017
Value Function Approximation
Course Assignment 1 (To be Announced)
February 7, 2017
[Tutorial] Worked Examples and Q+A regarding Assignment
February 10, 2017
Policy Improvement and Optimization Methods
February 14, 2017
[Tutorial] Introduction to the Arcade Learning Environment (ALE)
Reference: ALE Website
February 17, 2017

Assignment 1 Due (4 pm, submit electronically and hand in hardcopy to ITO)
February 28, 2017
Abstraction: Options and Hierarchy
Course Assignment 2 (To be Announced)
March 3, 2017
Deep Reinforcement Learning I
March 7, 2017
Deep Reinforcement Learning II
March 10, 2017
Partial Obserability and the Partially Observed Markov Decision Process
March 14, 2017
POMDPs Contd.
February 17, 2017
[Tutorial] Discussion and tools Assignment 2
March 21, 2017
Controlled Sensing and Exploration
March 24, 2017
Controlled Sensing and Exploration Contd.
March 28, 2017
Inverse Reinforcement Learning and Transfer Learning
Assignment 2 Due (4 pm, submit electronically and hand in hardcopy to ITO)
March 31, 2017
Monte-Carlo Tree Search and Game Playing
April 4, 2017
Multi-agent Reinforcement Learning
April 7, 2017
[Tutorial] Q+A and Review for Exam

RL Home

Home : Teaching : Courses : Rl 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail:
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh