Automatic Speech Recognition (ASR): 2012/13

[course descriptor]

Lecturer

News

Syllabus

  1. Introduction to Speech Recognition (slides) [Lecture 1: 14 January 2013]
  2. Speech signal analysis (slides) [Lectures 2, 3: 18, 25 January 2013]
  3. Acoustic modelling basics: HMMs and GMMs (slides) [Lectures 4, 5: 28, 31 January 2013]
  4. Context-dependent phone modelling with HMMs (slides) [Lectures 6, 7: 4, 7 February 2013]
  5. Lexicon and language model (slides) [Lecture 8: 11 February 2013]
  6. Search and decoding (slides) [Lecture 9: 14 February 2013]
  7. Speaker adaptation (slides) [Lectures 10, 11: 25, 28 February 2013]
  8. Robustness to the acoustic environment (slides) [Lecture 12: 4 March 2013]
  9. Discriminative training of GMM-based systems (slides) [Lecture 13: 11 March 2013]
  10. (Deep) neural networks (slides) [Lectures 14, 15: 11, 18 March 2013]
  11. Case study: transcribing TED talks (slides) [Lecture 16: 21 March 2013]

Readings: Useful texts

Schedule

Closer to the exam I am very happy to arrange a revision lecture at a time convenient to everyone. The point of this lecture will be to answer and discuss any questions about the course.

Coursework

The coursework will involve training and testing a continuous speech recognition system using the HTK software. We'll use the WSJCAM0 database (British English recordings of speakers reading the Wall Street Journal sentences). It will come in two parts.


 
 
 


Home : Teaching : Courses : Asr 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh