Report EDI-INF-RR-1346

Informatics Report Series

Report

EDI-INF-RR-1346

Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home

Title:A Novel Method for Learning Policies from Variable Constraint Data

Authors: Matthew Howard ; Stefan Klanke ; Michael Gienger ; Christian Goerick ; Sethu Vijayakumar

Date:Aug 2009

Publication Title:Autonomous Robots

Publisher:Springer

Publication Type:Journal Article Publication Status:Published

Volume No:27 Page Nos:105-121

DOI:10.1007/s10514-009-9129-8

Abstract:: Many everyday human skills can be framed in terms of performing some task subject to constraints imposed by the environment. Constraints are usually unobservable and frequently change between contexts. In this paper, we present a novel approach for learning (unconstrained) control policies from movement data, where observations come from movements under different constraints. As a key ingredient, we introduce a small but highly effective modification to the standard risk functional, allowing us to make a meaningful comparison between the estimated policy and constrained observations. We demonstrate our approach on systems of varying complexity, including kinematic data from the ASIMO humanoid robot with 27 degrees of freedom, and present results for learning from human demonstration.
Copyright:: 2009 by The University of Edinburgh. All Rights Reserved

Links To Paper
1st Link

Bibtex format
@Article{EDI-INF-RR-1346,: author = { Matthew Howard and Stefan Klanke and Michael Gienger and Christian Goerick and Sethu Vijayakumar },; title = {A Novel Method for Learning Policies from Variable Constraint Data},; journal = {Autonomous Robots},; publisher = {Springer},; year = 2009,; month = {Aug},; volume = {27},; pages = {105-121},; doi = {10.1007/s10514-009-9129-8},; url = {http://www.springerlink.com/content/r5u85525p6171g17/},
}

Home : Publications : Report

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh