Reinforcement Learning (Level 11)
2016-17 Semester 2
Lecturer: Subramanian Ramamoorthy (s.ramamoorthy@ed.ac.uk
)
Lecture times: Tuesday and Friday 12:10 - 13:00 at
Teviot Lecture Theatre, Medical School, Doorway 5
(map).
First lecture on 17/1/2017
Lecture topics, handouts and schedule
here
Assessment:
The course mark will be computed using the following weighting:
- Homework 1: 10%
- Homework 2: 10%
- Final Exam: 80%
Readings:
- R. Sutton and A. Barto, Reinforcement Learning, MIT Press, 1998. (Required)
This book has an associated web page, including supplementary material.
- S. Thrun, W. Burgard, D. Fox, Probabilistic Robotics, MIT Press, 2006.
This book is available
in the library and in the CDT RAS library.
- D.P. Bertsekas, Dynamic Programming and Optimal Control, 2 Vols., Athena Scientific Press, 2005.
Book website
- W.B. Powell, Optimal Learning, Wiley, 2012.
Book website
- V. Krishnamurthy, Partially Observed Markov Decision Processes, Cambridge University Press, 2016.
This book is available online, through the library.
Other specific references (e.g., research papers) will be suggested along with corresponding lectures.
Admin:
Course secretary:
Ms. Alexandra Welsh
This page is maintained by the course lecturer