Informatics Report Series


Report   

EDI-INF-RR-1103


Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home
Title:Learning Depth First-Search: a Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings and its Application to MDPs
Authors: B. Bonet ; Hector Geffner
Date: 2006
Publication Title:Proc. 16th Int. Conf. on Automated Planning and Scheduling (ICAPS-06)
Publisher:AAAI Press
Publication Type:Conference Paper
Page Nos:142-151
ISBN/ISSN:9781577352
Abstract:
Dynamic Programming provides a convenient and unified framework for studying many state models used in AI but no algorithms for handling large spaces. Heuristic-search methods, on the other hand, can handle large spaces but lack a common foundation. In this work, we combine the benefits of a general dynamic programming formulation with the power of heuristic-search techniques for developing an algorithmic framework, that we call Learning Depth-First Search, that aims to be both general and effective. LDFS is a simple piece of code that performs iterated depth-first searches enhanced with learning. For deterministic actions and monotone value functions, LDFS reduces to IDA* with transposition tables, while for Game Trees, to the state-of-the-art iterated Alpha-Beta search algorithm with Null Windows known as MTD. For other models, like AND/OR graphs and MDPs, LDFS yields new, simple, and competitive algorithms. We show this here for MDPs.
Links To Paper
No links available
Bibtex format
@InProceedings{EDI-INF-RR-1103,
author = { B. Bonet and Hector Geffner },
title = {Learning Depth First-Search: a Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings and its Application to MDPs},
book title = {Proc. 16th Int. Conf. on Automated Planning and Scheduling (ICAPS-06)},
publisher = {AAAI Press},
year = 2006,
pages = {142-151},
}


Home : Publications : Report 

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh