Informatics Report Series


Report   

EDI-INF-RR-0801


Related Pages

Report (by Number) Index
Report (by Date) Index
Author Index
Institute Index

Home
Title:Rule-Based Chunking and Reusability
Authors: Claire Grover ; Richard Tobin
Date:May 2006
Publication Title:Proceedings of LREC 2006 (Conference on Language Resources and Evaluation)
Publication Type:Conference Paper Publication Status:Published
Abstract:
In this paper we discuss a rule-based approach to chunking implemented using the LT-XML2 and LT-TTT2 tools. We describe the tools and the pipeline and grammars that have been developed for the task of chunking. We show that our rule-based approach is easy to adapt to different chunking styles and that the mark-up of further linguistic information such as nominal and verbal heads can be added to the rules at little extra cost. We evaluate our chunker against the CoNLL 2000 data and discuss discrepancies between our output and the CoNLL mark-up as well as discrepancies within the CoNLL data itself. We contrast our results with the higher scores obtained using machine learning and argue that the portability and flexibility of our approach still make it a more practical solution.
Copyright:
2006 by ELRA. All Rights Reserved
Links To Paper
No links available
Bibtex format
@InProceedings{EDI-INF-RR-0801,
author = { Claire Grover and Richard Tobin },
title = {Rule-Based Chunking and Reusability},
book title = {Proceedings of LREC 2006 (Conference on Language Resources and Evaluation)},
year = 2006,
month = {May},
}


Home : Publications : Report 

Please mail <reports@inf.ed.ac.uk> with any changes or corrections.
Unless explicitly stated otherwise, all material is copyright The University of Edinburgh