ASR 2018-19  |  News Archive  |  Lectures  |  Labs  |  Coursework  |  Piazza

Lecture 16 - End-to-end systems 2: Sequence-to-sequence models

In this lecture we reviewed the pros and cons of CTC-based systems and looked at two sequence-to-sequence models:

Probably the best reading on this is the Google paper on Listen, Attend, and Spell by Chan et al, along with the Interspeech-2017 paper by Prabhavalkar et al C comparison of sequence-to-sequence models for speech recognition.

CTC recap

RNN Transducer

Attention-based encoder-decoder


Copyright (c) University of Edinburgh 2015-2019
The ASR course material is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License
licence.txt
This page maintained by Steve Renals.
Last updated: 2019/04/24 17:50:47UTC


Home : Teaching : Courses : Asr 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh