Report EDI-INF-RR-0673

Informatics Report Series

Report

EDI-INF-RR-0673

Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home

Title:Speech Recognition Using Asynchronous Transition HMM

Authors: Shigeki Matsuda ; Mitsuru Nakai ; Hiroshi Shimodaira ; Shigeki Sagayama

Date:Jun 2003

Publication Title:IEICE Trans. (D-II)

Publication Type:Journal Article

Volume No:J86-D-II(6 Page Nos:741-754

Abstract:: We propose asynchronous-transition HMM (AT-HMM) that is based on asynchronous transition structures among individual features of acoustic feature vector sequences. Conventional HMM represents vector sequences by using a chain of states, each state has vector distributions of multi-dimensions. Therefore, the conventional HMM assumes that individual features change synchronously. However, this assumption seems over-simplified for modeling the temporal behavior of acoustic features, since cepstrum and its time-derivative can not synchronize with each other. In speaker-dependent continuous phoneme recognition task, the AT-HMMs reduced errors by 10% to 40%. In speaker-independent task, the performance of the AT-HMMs was comparable to conventional HMMs.

Links To Paper
this url is for the conference paper containing a subset of the full paper in Japanese
this url is for the conference paper containing a subset of the full paper in Japanese

Bibtex format
@Article{EDI-INF-RR-0673,: author = { Shigeki Matsuda and Mitsuru Nakai and Hiroshi Shimodaira and Shigeki Sagayama },; title = {Speech Recognition Using Asynchronous Transition HMM},; journal = {IEICE Trans. (D-II)},; year = 2003,; month = {Jun},; volume = {J86-D-II(6},; pages = {741-754},; url = {http://intl.ieeexplore.ieee.org/xpl/abs_free.jsp?arNumber=859132},
}

Home : Publications : Report

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh