Informatics Report Series



Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Title:Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-based Speech Synthesis
Authors: Junichi Yamagishi ; Koji Onishi ; Takashi Masuko ; Takao Kobayashi
Date:Mar 2005
Publication Title:IEICE TRANSACTIONS on Information and Systems
Publisher:Oxford University Press
Publication Type:Journal Article Publication Status:Published
Volume No:E88-D Page Nos:502-508
DOI:10.1093/ietisy/e88-d.3.502 ISBN/ISSN:09168532
This paper describes the modeling of various emotional expressions and speaking styles in synthetic speech using HMM-based speech synthesis. We show two methods for modeling speaking styles and emotional expressions. In the first method called style-dependent modeling, each speaking style and emotional expression is modeled individually. In the second one called style-mixed modeling, each speaking style and emotional expression is treated as one of contexts as well as phonetic, prosodic, and linguistic features, and all speaking styles and emotional expressions are modeled simultaneously by using a single acoustic model. We chose four styles of read speech - neutral, rough, joyful, and sad - and compared the above two modeling methods using these styles. The results of subjective evaluation tests show that both modeling methods have almost the same accuracy, and that it is possible to synthesize speech with the speaking style and emotional expression similar to those of the target speech. In a test of classification of styles in synthesized speech, more than 80% of speech samples generated using both the models were judged to be similar to the target styles. We also show that the style-mixed modeling method gives fewer output and duration distributions than the style-dependent modeling method.
Links To Paper
Bibtex format
author = { Junichi Yamagishi and Koji Onishi and Takashi Masuko and Takao Kobayashi },
title = {Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-based Speech Synthesis},
journal = {IEICE TRANSACTIONS on Information and Systems},
publisher = {Oxford University Press},
year = 2005,
month = {Mar},
volume = {E88-D},
pages = {502-508},
doi = {10.1093/ietisy/e88-d.3.502},
url = {},

Home : Publications : Report 

Please mail <> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh