ASR 2018-19  |  News Archive  |  Lectures  |  Labs  |  Coursework  |  Piazza

Lecture 13 - Speaker adaptation

The aim of speaker adaptation is to use a small amount of speaker-specific data to adapt a speaker-independent speech recognition system such that is tuned to the speech of a specific speaker. One could imagine adaptation approaches at different levels - language model adaptation, pronunciation model adaptation, acoustic model adaptation. Although pronunciation model adaptation is a compelling idea, and has been explored, there hasn't (yet?) been a method in which has had consistent success. Language model adaptation has been more focused on domain of use and topic, and has had some success. This lecture focuses on acoustic model adaptation: there has been a lot of work in this area, and several successful, heavily used techniques.

Woodland's review article from 2001 still gives good coverage of adapatation for HMM/GMM systems. Swietojanski et al's LHUC paper is a good paper about adapting DNN based systems. Also Saon et al's paper on i-vector based adaptation.


Copyright (c) University of Edinburgh 2015-2019
The ASR course material is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License
licence.txt
This page maintained by Steve Renals.
Last updated: 2019/04/23 17:41:15UTC


Home : Teaching : Courses : Asr 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh