| Lecture No. | Date | Week | Lecturer | Topic and slides | Reading |
| 1 | Mon 13 January | 1 | Renals | Introduction to Speech Recognition (slides) | J&M: chapter 7, chapter 9 (9.1 - 9.3) R&H review chapter |
| 2 | Thu 16 January | 1 | Shimodaira | Speech Signal Analysis 1 (slides) | J&M: Sec 9.3 Taylor, chapters 10, 12 |
| 3 | Mon 20 January | 2 | Shimodaira | Speech Signal Analysis 2 | Hermansky (1990), PLP analysis of speech |
| 4 | Thu 23 January | 2 | Shimodaira | Acoustic modelling basics: HMMs and GMMs 1 (slides-4up,slides) | J&M: Secs 6.1-6.5, 9.2, 9.4 G&Y review R&H review chapter Rabiner & Juang (1986) Tutorial |
| 5 | Mon 27 January | 3 | Shimodaira | Acoustic modelling basics: HMMs and GMMs 2 | |
| 6 | Thu 30 January | 3 | Renals | Context-dependent phone modelling with HMMs 1 (slides) |
Young (2008) Lee (1990) Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition |
| 7 | Mon 3 February | 4 | Renals | Context-dependent phone modelling with HMMs 2 |
Young & Woodland (1994) State clustering in hidden Markov model-based continuous speech recognition Young et al (1994). Tree-based state tying for high accuracy acoustic modelling, |
| Thu 6 February | 4 | Shimodaira | Introduction to Assignment 1 | Assignment 1: continuous speech recognition | |
| Thu 6 February | 4 | Lab session (17:00) | |||
| 8 | Mon 10 February | 5 | Renals | Lexicon and language model (slides) |
J&M, Ch 4 Manning & Schutze, Ch 6 |
| Mon 10 February | 5 | Lab session (17:00) | |||
| 9 | Thu 13 February | 5 | Shimodaira | Search and decoding (slides) | Aubert (2002) An overview of decoding techniques for large vocabulary continuous speech recognition |
| Thu 13 February | 5 | Lab session (17:00) | |||
| Mon 17 February | 6 | No Lecture - Innovative Learning Week | |||
| Thu 20 February | 6 | No Lecture - Innovative Learning Week | |||
| 10 | Mon 24 February | 7 | Renals | Intro to neural networks (slides) | Multi-layer neural networks Morgan & Bourlard (1995), Continuous speech recognition: An introduction to the hybrid HMM/connectionist approach |
| Mon 24 February | 7 | Lab session (17:30) | |||
| Wed 26 February | 7 | Assignment 1 Deadline (16:00) | |||
| 11 | Thu 27 February | 7 | Renals | (Deep) neural network acoustic models (slides) | Hinton et al (2012), Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups |
| Mon 3 March | 8 | Renals | Introduction to Assignment 2 | Assignment 2: literature review | |
| 12 | Thu 6 March | 8 | Renals | Neural network language models (slides) | Bengio et al (2006), Neural probabilistic language models(Secs 6.1, 6.2, 6.3, 6.7, 6.8) Mikolov et al (2011), Extensions of recurrent neural network language model |
| 13 | Mon 10 March | 9 | Renals | Speaker adaptation 1 (slides) | G&Y review, sec. 5 Woodland (2001), Speaker adaptation for continuous density HMMs: A review |
| 14 | Thu 13 March | 9 | Renals | Speaker adaptation 2 | |
| 15 | Mon 17 March | 10 | Renals | Discriminative training of GMM-based systems (slides) | Young (2008), sec 27.3.1 |
| Wed 19 March | 10 | Assignment 2 Deadline (16:00) | |||
| 16 | Thu 20 March | 10 | Case study: transcribing TED talks (slides) |
Closer to the exam we are very happy to arrange a revision lecture at a time convenient to everyone. The point of this lecture will be to answer and discuss any questions about the course.
There are two pieces of coursework.
|
Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk Please contact our webadmin with any comments or corrections. Logging and Cookies Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh |