Report EDI-INF-RR-1103

Informatics Report Series

Report

EDI-INF-RR-1103

Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home

Title:Learning Depth First-Search: a Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings and its Application to MDPs

Authors: B. Bonet ; Hector Geffner

Date: 2006

Publication Title:Proc. 16th Int. Conf. on Automated Planning and Scheduling (ICAPS-06)

Publisher:AAAI Press

Publication Type:Conference Paper

Page Nos:142-151

ISBN/ISSN:9781577352

Abstract:: Dynamic Programming provides a convenient and unified framework for studying many state models used in AI but no algorithms for handling large spaces. Heuristic-search methods, on the other hand, can handle large spaces but lack a common foundation. In this work, we combine the benefits of a general dynamic programming formulation with the power of heuristic-search techniques for developing an algorithmic framework, that we call Learning Depth-First Search, that aims to be both general and effective. LDFS is a simple piece of code that performs iterated depth-first searches enhanced with learning. For deterministic actions and monotone value functions, LDFS reduces to IDA* with transposition tables, while for Game Trees, to the state-of-the-art iterated Alpha-Beta search algorithm with Null Windows known as MTD. For other models, like AND/OR graphs and MDPs, LDFS yields new, simple, and competitive algorithms. We show this here for MDPs.

Links To Paper: No links available

Bibtex format
@InProceedings{EDI-INF-RR-1103,: author = { B. Bonet and Hector Geffner },; title = {Learning Depth First-Search: a Unified Approach to Heuristic Search in Deterministic and Non-Deterministic Settings and its Application to MDPs},; book title = {Proc. 16th Int. Conf. on Automated Planning and Scheduling (ICAPS-06)},; publisher = {AAAI Press},; year = 2006,; pages = {142-151},
}

Home : Publications : Report

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh