Informatics Report Series


Report   

EDI-INF-RR-0888


Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home
Title:XML-Based Data Preparation for Robust Deep Parsing
Authors: Claire Grover ; Alex Lascarides
Date:Jul 2001
Publication Title:Proceedings of the Joint EACL-ACL Meeting (ACL-EACL 2001)
Publication Type:Conference Paper Publication Status:Published
DOI:10.3115/1073012.1073046
Abstract:
We describe the use of XML tokenisation, tagging and mark-up tools to prepare a corpus for parsing. Our techniques are generally applicable but here we focus on parsing Medline abstracts with the ANLT wide-coverage grammar. Hand-crafted grammars inevitably lack coverage but many coverage failures are due to inadequacies of their lexicons. We describe a method of gaining a degree of robustness by interfacing POS tag information with the existing lexicon. We also show that XML tools provide a sophisticated approach to pre-processing, helping to ameliorate the messiness in real language data and improve parse performance.
Links To Paper
1st Link
Bibtex format
@InProceedings{EDI-INF-RR-0888,
author = { Claire Grover and Alex Lascarides },
title = {XML-Based Data Preparation for Robust Deep Parsing},
book title = {Proceedings of the Joint EACL-ACL Meeting (ACL-EACL 2001)},
year = 2001,
month = {Jul},
doi = {10.3115/1073012.1073046},
url = {http://acl.ldc.upenn.edu/P/P01/P01-1034.pdf},
}


Home : Publications : Report 

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh