Lecture No. | Date | Week | Lecturer | Topic and slides | Reading |
1 | Mon 13 January | 1 | Renals | Introduction to Speech Recognition (slides) | J&M: chapter 7, chapter 9 (9.1 - 9.3) R&H review chapter |
2 | Thu 16 January | 1 | Shimodaira | Speech Signal Analysis 1 (slides) | J&M: Sec 9.3 Taylor, chapters 10, 12 |
3 | Mon 20 January | 2 | Shimodaira | Speech Signal Analysis 2 | Hermansky (1990), PLP analysis of speech |
4 | Thu 23 January | 2 | Shimodaira | Acoustic modelling basics: HMMs and GMMs 1 (slides-4up,slides) | J&M: Secs 6.1-6.5, 9.2, 9.4 G&Y review R&H review chapter Rabiner & Juang (1986) Tutorial |
5 | Mon 27 January | 3 | Shimodaira | Acoustic modelling basics: HMMs and GMMs 2 | |
6 | Thu 30 January | 3 | Renals | Context-dependent phone modelling with HMMs 1 (slides) |
Young (2008) Lee (1990) Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition |
7 | Mon 3 February | 4 | Renals | Context-dependent phone modelling with HMMs 2 |
Young & Woodland (1994) State clustering in hidden Markov model-based continuous speech recognition Young et al (1994). Tree-based state tying for high accuracy acoustic modelling, |
Thu 6 February | 4 | Shimodaira | Introduction to Assignment 1 | Assignment 1: continuous speech recognition | |
Thu 6 February | 4 | Lab session (17:00) | |||
8 | Mon 10 February | 5 | Renals | Lexicon and language model (slides) |
J&M, Ch 4 Manning & Schutze, Ch 6 |
Mon 10 February | 5 | Lab session (17:00) | |||
9 | Thu 13 February | 5 | Shimodaira | Search and decoding (slides) | Aubert (2002) An overview of decoding techniques for large vocabulary continuous speech recognition |
Thu 13 February | 5 | Lab session (17:00) | |||
Mon 17 February | 6 | No Lecture - Innovative Learning Week | |||
Thu 20 February | 6 | No Lecture - Innovative Learning Week | |||
10 | Mon 24 February | 7 | Renals | Intro to neural networks (slides) | Multi-layer neural networks Morgan & Bourlard (1995), Continuous speech recognition: An introduction to the hybrid HMM/connectionist approach |
Mon 24 February | 7 | Lab session (17:30) | |||
Wed 26 February | 7 | Assignment 1 Deadline (16:00) | |||
11 | Thu 27 February | 7 | Renals | (Deep) neural network acoustic models (slides) | Hinton et al (2012), Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups |
Mon 3 March | 8 | Renals | Introduction to Assignment 2 | Assignment 2: literature review | |
12 | Thu 6 March | 8 | Renals | Neural network language models (slides) | Bengio et al (2006), Neural probabilistic language models(Secs 6.1, 6.2, 6.3, 6.7, 6.8) Mikolov et al (2011), Extensions of recurrent neural network language model |
13 | Mon 10 March | 9 | Renals | Speaker adaptation 1 (slides) | G&Y review, sec. 5 Woodland (2001), Speaker adaptation for continuous density HMMs: A review |
14 | Thu 13 March | 9 | Renals | Speaker adaptation 2 | |
15 | Mon 17 March | 10 | Renals | Discriminative training of GMM-based systems (slides) | Young (2008), sec 27.3.1 |
Wed 19 March | 10 | Assignment 2 Deadline (16:00) | |||
16 | Thu 20 March | 10 | Case study: transcribing TED talks (slides) |
Closer to the exam we are very happy to arrange a revision lecture at a time convenient to everyone. The point of this lecture will be to answer and discuss any questions about the course.
There are two pieces of coursework.
Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk Please contact our webadmin with any comments or corrections. Logging and Cookies Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh |