ASR 2018-19  |  News Archive  |  Lectures  |  Labs  |  Coursework  |  Piazza

Lecture 9 - Neural Network Acoustic Models 3: CD DNNs and TDNNs

This lecture discussed the DNN- and TDNN-based acoustic models used in state-of-the-art systems. These are context-dependent deep neural networls with very wide output layers (typically 10,000 or more) corresponding to the context-dependent tied states of an HMM/GMM system. We also introduce the time-delay neural network, an approach which can learn wide receptive fields onto the input layer using hidden layers which each process a window from the previous layer.

There are surprisingly few clear and comprehensive recent articles about DNN acoustic models. The bast is probably Maas et al (2017), Building DNN acoustic models for large vocabulary speech recognition. For TDNNs the best paper to read is Peddinti et al (2015), A time delay neural network architecture for efficient modeling of long temporal contexts.

Context-dependent DNNs


Copyright (c) University of Edinburgh 2015-2019
The ASR course material is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License
This page maintained by Steve Renals.
Last updated: 2019/04/17 13:53:16UTC

Home : Teaching : Courses : Asr 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail:
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh