Informatics Report Series


Report   

EDI-INF-RR-0899


Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home
Title:Learning More Effective Dialogue Strategies Using Limited Dialogue Move Features
Authors: Matthew Frampton ; Oliver Lemon
Date: 2006
Publication Title:Association for Computational Lingusitics
Publication Type:Conference Paper Publication Status:Published
Page Nos:185-192
DOI:10.3115/1220175.1220199
Abstract:
We explore the use of restricted dialogue contexts in reinforcement learning (RL) of effective dialogue strategies for information seeking spoken dialogue systems (e.g. COMMUNICATOR). The contexts we use are richer than previous research in this area, which use only slot-based information, but are much less complex than the full dialogue ``Information States'' explored in Henderson et a. 2005, for which tractability is an issue. We explore how incrementally adding richer features allows learning of more effective dialogue strategies. We use 2 user simulations learned from COMMUNICATOR data to explore the effects of different features on learned dialogue strategies. Our results show that adding the last system and user dialogue moves increases the average reward of the automatically learned strategies by 65.9% over the original (hand-coded) COMMUNICATOR systems, and by 7.8% over a baseline RL policy that uses only slot-status features. We show that the new strategies exhibit a ``focus switching'' strategy and effective use of the `give help' action.
Links To Paper
Author homepage
Project website
Bibtex format
@InProceedings{EDI-INF-RR-0899,
author = { Matthew Frampton and Oliver Lemon },
title = {Learning More Effective Dialogue Strategies Using Limited Dialogue Move Features},
book title = {Association for Computational Lingusitics},
year = 2006,
pages = {185-192},
doi = {10.3115/1220175.1220199},
url = {http://homepages.inf.ed.ac.uk/olemon/},
}


Home : Publications : Report 

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh