Informatics Report Series
|
|
|
|
|
|
Title:XML-Based Data Preparation for Robust Deep Parsing |
Authors:
Claire Grover
; Alex Lascarides
|
Date:Jul 2001 |
Publication Title:Proceedings of the Joint EACL-ACL Meeting (ACL-EACL 2001) |
Publication Type:Conference Paper
Publication Status:Published
|
|
DOI:10.3115/1073012.1073046
|
- Abstract:
-
We describe the use of XML tokenisation, tagging and mark-up tools to prepare a corpus for parsing. Our techniques are generally applicable but here we focus on parsing Medline abstracts with the ANLT wide-coverage grammar. Hand-crafted grammars inevitably lack coverage but many coverage failures are due to inadequacies of their lexicons. We describe a method of gaining a degree of robustness by interfacing POS tag information with the existing lexicon. We also show that XML tools provide a sophisticated approach to pre-processing, helping to ameliorate the messiness in real language data and improve parse performance.
- Links To Paper
- 1st Link
- Bibtex format
- @InProceedings{EDI-INF-RR-0888,
- author = {
Claire Grover
and Alex Lascarides
},
- title = {XML-Based Data Preparation for Robust Deep Parsing},
- book title = {Proceedings of the Joint EACL-ACL Meeting (ACL-EACL 2001)},
- year = 2001,
- month = {Jul},
- doi = {10.3115/1073012.1073046},
- url = {http://acl.ldc.upenn.edu/P/P01/P01-1034.pdf},
- }
|